Commit graph

41 commits

Author SHA1 Message Date
Krrish Dholakia
3fdfb8470d fix(replicate.py): correctly handly invalid replicate response 2024-07-03 10:09:46 -07:00
Krrish Dholakia
efaffd84e1 fix(utils.py): catch 422-status errors 2024-06-24 19:41:48 -07:00
Krrish Dholakia
3896e3e88f fix: fix streaming with httpx client
prevent overwriting streams in parallel streaming calls
2024-05-31 10:55:18 -07:00
Krrish Dholakia
e41897808d fix(replicate.py): move replicate calls to being completely async
Closes https://github.com/BerriAI/litellm/issues/3128
2024-05-16 17:24:08 -07:00
Krrish Dholakia
5f93cae3ff feat(proxy_server.py): return litellm version in response headers 2024-05-08 16:00:08 -07:00
Krrish Dholakia
92bf686b10 fix(replicate.py): pass version if passed in 2024-04-26 17:11:21 -07:00
Krrish Dholakia
cf24e3eb02 fix(factory.py): add replicate meta llama prompt templating support 2024-04-25 08:25:00 -07:00
Krrish Dholakia
b10f03706d fix(utils.py): fix streaming to not return usage dict
Fixes https://github.com/BerriAI/litellm/issues/3237
2024-04-24 08:06:07 -07:00
Krrish Dholakia
76a298ec81 fix(replicate.py): handle empty response - replicate 2024-04-09 21:57:25 -07:00
Krrish Dholakia
61db2d775e fix(replicate.py): handle model endpoint 2024-03-05 19:06:24 -08:00
Krrish Dholakia
79978c44ba refactor: add black formatting 2023-12-25 14:11:20 +05:30
Krish Dholakia
9187faf4fd Merge pull request #1080 from nbaldwin98/fixing-replicate-sys-prompt
fix replicate system prompt: forgot to add **optional_params to input data
2023-12-11 07:11:52 -08:00
chabala98
1a7bd91ccb adfix: ded **optional_params in input data when system prompt is available (allows to papass other optional params apart from sys prompt) 2023-12-11 14:42:05 +01:00
Krish Dholakia
eab7d41cd3 Merge pull request #970 from nbaldwin98/fixing-replicate-sys-prompt
fix system prompts for replicate
2023-12-04 16:39:44 -08:00
ishaan-jaff
9506f66846 (feat) replicate/deployments: add POST Req view 2023-12-04 13:43:03 -08:00
chabala98
a8f2639e11 fix system prompts for replicate 2023-12-01 13:16:35 +01:00
Krrish Dholakia
a05722571b fix(replicate.py): fix custom prompt formatting 2023-11-29 19:44:09 -08:00
ishaan-jaff
7bc28f3b1c (fix) pydantic errors with response.time 2023-11-20 18:28:19 -08:00
Krrish Dholakia
4b74ddcb17 refactor: fixing linting issues 2023-11-11 18:52:28 -08:00
Krrish Dholakia
5efa3860da refactor(huggingface,-anthropic,-replicate,-sagemaker): making huggingface, anthropic, replicate, sagemaker compatible openai v1 sdk 2023-11-11 17:38:15 -08:00
ishaan-jaff
b50824de58 (feat) replicate add exception mapping for streaming + better logging when polling 2023-11-10 12:46:33 -08:00
ishaan-jaff
5fc73824bb (fix) replicate print verbose 2023-11-10 12:46:33 -08:00
ishaan-jaff
a6bb5ca8a8 (feat) replicate exception mapping 2023-11-10 12:46:33 -08:00
ishaan-jaff
78c93a0c5a (feat) replicate exception mapping 2023-11-10 12:46:33 -08:00
ishaan-jaff
c2b263e66d (feat) completion - replicate custom deployments 2023-11-09 18:10:38 -08:00
ishaan-jaff
485a7ff136 (feat) use usage class for model responses for cohere, hf, tg ai, cohere 2023-10-27 09:58:47 -07:00
Krrish Dholakia
18a6facdb3 fix: allow api base to be set for all providers
enables proxy use cases
2023-10-19 19:07:42 -07:00
Krrish Dholakia
cc0e4f4f9f fix: fix value error if model returns empty completion 2023-10-10 10:11:40 -07:00
Krrish Dholakia
69cdf5347a style(test_completion.py): fix merge conflict 2023-10-05 22:09:38 -07:00
Krrish Dholakia
4ce5c1d1dc fix replicate error 2023-10-02 21:35:16 -07:00
ishaan-jaff
39a77fed47 modify replicate polling interval 2023-10-02 07:43:38 -07:00
Krrish Dholakia
c9df59cc00 remove print statement in replicate.py 2023-09-27 10:43:06 -07:00
Krrish Dholakia
3cac04f459 fix exception mapping for streaming 2023-09-23 15:04:59 -07:00
ishaan-jaff
1eed36eb1d add replicate pricing 2023-09-06 18:14:34 -07:00
Krrish Dholakia
311bfb7bb7 logging replicate response logs 2023-09-06 11:28:40 -07:00
ishaan-jaff
a2d425f7de add flan + vicuna + fix replicate errors 2023-09-06 11:23:58 -07:00
Krrish Dholakia
8189a16188 updates 2023-09-06 11:21:48 -07:00
ishaan-jaff
89ebdab2b3 add replicate support for max_tokens 2023-09-06 10:38:21 -07:00
ishaan-jaff
d8dfa2d80d add Replicate Error class 2023-09-06 10:25:40 -07:00
ishaan-jaff
74e0e90620 add replicate streaming 2023-09-06 10:23:13 -07:00
ishaan-jaff
6fb01ec257 use replicate http requests instead 2023-09-06 09:43:05 -07:00