Commit graph

1456 commits

Author SHA1 Message Date
Krrish Dholakia
beecb60f51 update testing 2023-09-09 16:35:38 -07:00
Krrish Dholakia
8ed85b0523 rename apimanager to budget manager 2023-09-09 16:10:41 -07:00
Krrish Dholakia
a39756bfda add api manager 2023-09-09 15:55:38 -07:00
ishaan-jaff
5ab3ca2018 tests 2023-09-09 15:47:38 -07:00
ishaan-jaff
56bd8c1c52 olla upgrades, fix streaming, add non streaming resp 2023-09-09 14:07:13 -07:00
Krrish Dholakia
0609fd43d7 gorilla endpoint is flaky, commenting out for now 2023-09-09 12:48:11 -07:00
Krrish Dholakia
a9cab12a47 improve error message returned if model not passed in 2023-09-09 11:18:10 -07:00
ishaan-jaff
5ca8b23e22 bump version 2023-09-08 21:37:38 -07:00
ishaan-jaff
ba70ad766d fix redis caching test 2023-09-08 20:58:44 -07:00
Krrish Dholakia
c556a51139 fix test exceptions 2023-09-08 20:27:44 -07:00
ishaan-jaff
8180ba273b updating caching tests 2023-09-08 20:15:15 -07:00
Krrish Dholakia
59d6703b0c bump version 2023-09-08 19:26:41 -07:00
Krrish Dholakia
d02ab9bfcd update exception logic 2023-09-08 18:55:11 -07:00
Krrish Dholakia
323e095688 fix exception mapping error 2023-09-08 18:20:07 -07:00
ishaan-jaff
0ab62f13e8 caching updates 2023-09-08 18:06:47 -07:00
ishaan-jaff
4de44a691f bump v 2023-09-08 17:47:03 -07:00
ishaan-jaff
7158ea3ab8 bump version to 0.1.564 2023-09-08 17:46:30 -07:00
ishaan-jaff
c45e2ed48c hosted vllm usage 2023-09-08 13:58:06 -07:00
ishaan-jaff
b0a2e57a8a fix:completion fails when client=True&no email set 2023-09-08 12:36:33 -07:00
Krrish Dholakia
554b05015e fix litedebugger double logging error 2023-09-07 18:02:24 -07:00
Krrish Dholakia
147a877aca bump version 2023-09-07 17:00:20 -07:00
Krrish Dholakia
e452ceb21e fix litellm client 2023-09-07 16:22:00 -07:00
ishaan-jaff
8f5e2d0013 bug fix - issue 277 2023-09-07 14:11:10 -07:00
ishaan-jaff
3e5c972e91 bump version litellm 2023-09-07 14:03:37 -07:00
ishaan-jaff
301eb36095 cleanup async func test 2023-09-07 13:56:46 -07:00
ishaan-jaff
a611409e0f async streaming generator 2023-09-07 13:53:40 -07:00
Krrish Dholakia
35cf6ef0a1 batch completions for vllm now works too 2023-09-06 19:26:19 -07:00
ishaan-jaff
2880a7b6b4 allow users to pass custom timing for replicate 2023-09-06 18:32:40 -07:00
ishaan-jaff
fc7ad0c245 bump v 2023-09-06 18:14:58 -07:00
ishaan-jaff
8b3b682000 add replicate pricing 2023-09-06 18:14:34 -07:00
Krrish Dholakia
4cfcabd919 adding support for vllm 2023-09-06 18:07:44 -07:00
ishaan-jaff
bd77d5ac21 docs update 2023-09-06 17:16:24 -07:00
ishaan-jaff
1ba6b6761b show pricing for tg ai completion 2023-09-06 17:10:49 -07:00
ishaan-jaff
bab27634a8 rename max_tokens.json 2023-09-06 16:28:17 -07:00
ishaan-jaff
fbd67bc24c add experimental together_computer cost calc 2023-09-06 16:08:44 -07:00
Krrish Dholakia
0ace48d719 update custom prompt template function 2023-09-06 13:14:36 -07:00
Krrish Dholakia
4bcfa71e11 commenting out flaky circle ci test 2023-09-06 12:07:47 -07:00
Krrish Dholakia
91d2a57970 update linting 2023-09-06 11:30:40 -07:00
ishaan-jaff
1da6026622 add flan + vicuna + fix replicate errors 2023-09-06 11:23:58 -07:00
Krrish Dholakia
48ee4a08ac updates 2023-09-06 11:21:48 -07:00
ishaan-jaff
af60b2ba77 add vicuna translation 2023-09-06 11:14:24 -07:00
ishaan-jaff
d4c4a138ca add replicate support for max_tokens 2023-09-06 10:38:21 -07:00
ishaan-jaff
1c61b7b229 add replicate streaming 2023-09-06 10:23:13 -07:00
ishaan-jaff
c45b132675 use replicate http requests instead 2023-09-06 09:43:05 -07:00
ishaan-jaff
fb3983d058 fix timeouts + tests + bump v 2023-09-05 17:17:58 -07:00
ishaan-jaff
8ecef03f63 custom api base 2023-09-05 16:50:32 -07:00
ishaan-jaff
2250d1375e clean up azure implementation 2023-09-05 15:31:59 -07:00
Krrish Dholakia
9a82bfa4d2 test parallelism on circle ci 2023-09-05 14:47:50 -07:00
Krrish Dholakia
faa78ad543 bump version 2023-09-05 14:30:03 -07:00
Krrish Dholakia
8845938b31 adding support for custom prompt templates to together ai 2023-09-05 12:20:09 -07:00