Commit graph

1543 commits

Author SHA1 Message Date
Krrish Dholakia
323e095688 fix exception mapping error 2023-09-08 18:20:07 -07:00
ishaan-jaff
0ab62f13e8 caching updates 2023-09-08 18:06:47 -07:00
ishaan-jaff
4de44a691f bump v 2023-09-08 17:47:03 -07:00
ishaan-jaff
7158ea3ab8 bump version to 0.1.564 2023-09-08 17:46:30 -07:00
ishaan-jaff
c45e2ed48c hosted vllm usage 2023-09-08 13:58:06 -07:00
ishaan-jaff
b0a2e57a8a fix:completion fails when client=True&no email set 2023-09-08 12:36:33 -07:00
Krrish Dholakia
554b05015e fix litedebugger double logging error 2023-09-07 18:02:24 -07:00
Krrish Dholakia
147a877aca bump version 2023-09-07 17:00:20 -07:00
Krrish Dholakia
e452ceb21e fix litellm client 2023-09-07 16:22:00 -07:00
ishaan-jaff
8f5e2d0013 bug fix - issue 277 2023-09-07 14:11:10 -07:00
ishaan-jaff
3e5c972e91 bump version litellm 2023-09-07 14:03:37 -07:00
ishaan-jaff
301eb36095 cleanup async func test 2023-09-07 13:56:46 -07:00
ishaan-jaff
a611409e0f async streaming generator 2023-09-07 13:53:40 -07:00
Krrish Dholakia
35cf6ef0a1 batch completions for vllm now works too 2023-09-06 19:26:19 -07:00
ishaan-jaff
2880a7b6b4 allow users to pass custom timing for replicate 2023-09-06 18:32:40 -07:00
ishaan-jaff
fc7ad0c245 bump v 2023-09-06 18:14:58 -07:00
ishaan-jaff
8b3b682000 add replicate pricing 2023-09-06 18:14:34 -07:00
Krrish Dholakia
4cfcabd919 adding support for vllm 2023-09-06 18:07:44 -07:00
ishaan-jaff
bd77d5ac21 docs update 2023-09-06 17:16:24 -07:00
ishaan-jaff
1ba6b6761b show pricing for tg ai completion 2023-09-06 17:10:49 -07:00
ishaan-jaff
bab27634a8 rename max_tokens.json 2023-09-06 16:28:17 -07:00
ishaan-jaff
fbd67bc24c add experimental together_computer cost calc 2023-09-06 16:08:44 -07:00
Krrish Dholakia
0ace48d719 update custom prompt template function 2023-09-06 13:14:36 -07:00
Krrish Dholakia
4bcfa71e11 commenting out flaky circle ci test 2023-09-06 12:07:47 -07:00
Krrish Dholakia
91d2a57970 update linting 2023-09-06 11:30:40 -07:00
ishaan-jaff
1da6026622 add flan + vicuna + fix replicate errors 2023-09-06 11:23:58 -07:00
Krrish Dholakia
48ee4a08ac updates 2023-09-06 11:21:48 -07:00
ishaan-jaff
af60b2ba77 add vicuna translation 2023-09-06 11:14:24 -07:00
ishaan-jaff
d4c4a138ca add replicate support for max_tokens 2023-09-06 10:38:21 -07:00
ishaan-jaff
1c61b7b229 add replicate streaming 2023-09-06 10:23:13 -07:00
ishaan-jaff
c45b132675 use replicate http requests instead 2023-09-06 09:43:05 -07:00
ishaan-jaff
fb3983d058 fix timeouts + tests + bump v 2023-09-05 17:17:58 -07:00
ishaan-jaff
8ecef03f63 custom api base 2023-09-05 16:50:32 -07:00
ishaan-jaff
2250d1375e clean up azure implementation 2023-09-05 15:31:59 -07:00
Krrish Dholakia
9a82bfa4d2 test parallelism on circle ci 2023-09-05 14:47:50 -07:00
Krrish Dholakia
faa78ad543 bump version 2023-09-05 14:30:03 -07:00
Krrish Dholakia
8845938b31 adding support for custom prompt templates to together ai 2023-09-05 12:20:09 -07:00
Krrish Dholakia
64f3d3c56e prompt formatting for together ai llama2 models 2023-09-05 11:57:13 -07:00
Krrish Dholakia
dc130efac6 update streaming docs to show it working for async completion calls 2023-09-05 09:18:37 -07:00
Krrish Dholakia
e2c143dfbc test async streaming 2023-09-04 15:42:24 -07:00
ishaan-jaff
79acfb4dab fix aleph alpha client init 2023-09-04 15:14:09 -07:00
Krrish Dholakia
2384806cfd adding first-party + custom prompt templates for huggingface 2023-09-04 14:54:09 -07:00
ishaan-jaff
a474b89779 clean up hugging face completion() 2023-09-04 14:41:06 -07:00
ishaan-jaff
f0e2922710 bump version 2023-09-04 12:44:20 -07:00
ishaan-jaff
8f07c80733 boto3 testing + docs 2023-09-04 12:14:24 -07:00
ishaan-jaff
126830f08a allow users to set AWS_REGION_NAME 2023-09-04 11:57:22 -07:00
Krrish Dholakia
73bb1b96e9 update exception mapping and get model cost map 2023-09-04 11:53:20 -07:00
ishaan-jaff
e03d442e8f add optional params for llama-2 2023-09-04 11:41:20 -07:00
ishaan-jaff
4a4ee51df3 working sagemaker support 2023-09-04 11:30:34 -07:00
ishaan-jaff
46857577fa [temp] remove cache streaming flaky test 2023-09-04 09:50:45 -07:00