Krrish Dholakia
|
e41897808d
|
fix(replicate.py): move replicate calls to being completely async
Closes https://github.com/BerriAI/litellm/issues/3128
|
2024-05-16 17:24:08 -07:00 |
|
Krrish Dholakia
|
5f93cae3ff
|
feat(proxy_server.py): return litellm version in response headers
|
2024-05-08 16:00:08 -07:00 |
|
Krrish Dholakia
|
92bf686b10
|
fix(replicate.py): pass version if passed in
|
2024-04-26 17:11:21 -07:00 |
|
Krrish Dholakia
|
cf24e3eb02
|
fix(factory.py): add replicate meta llama prompt templating support
|
2024-04-25 08:25:00 -07:00 |
|
Krrish Dholakia
|
b10f03706d
|
fix(utils.py): fix streaming to not return usage dict
Fixes https://github.com/BerriAI/litellm/issues/3237
|
2024-04-24 08:06:07 -07:00 |
|
Krrish Dholakia
|
76a298ec81
|
fix(replicate.py): handle empty response - replicate
|
2024-04-09 21:57:25 -07:00 |
|
Krrish Dholakia
|
61db2d775e
|
fix(replicate.py): handle model endpoint
|
2024-03-05 19:06:24 -08:00 |
|
Krrish Dholakia
|
79978c44ba
|
refactor: add black formatting
|
2023-12-25 14:11:20 +05:30 |
|
Krish Dholakia
|
9187faf4fd
|
Merge pull request #1080 from nbaldwin98/fixing-replicate-sys-prompt
fix replicate system prompt: forgot to add **optional_params to input data
|
2023-12-11 07:11:52 -08:00 |
|
chabala98
|
1a7bd91ccb
|
adfix: ded **optional_params in input data when system prompt is available (allows to papass other optional params apart from sys prompt)
|
2023-12-11 14:42:05 +01:00 |
|
Krish Dholakia
|
eab7d41cd3
|
Merge pull request #970 from nbaldwin98/fixing-replicate-sys-prompt
fix system prompts for replicate
|
2023-12-04 16:39:44 -08:00 |
|
ishaan-jaff
|
9506f66846
|
(feat) replicate/deployments: add POST Req view
|
2023-12-04 13:43:03 -08:00 |
|
chabala98
|
a8f2639e11
|
fix system prompts for replicate
|
2023-12-01 13:16:35 +01:00 |
|
Krrish Dholakia
|
a05722571b
|
fix(replicate.py): fix custom prompt formatting
|
2023-11-29 19:44:09 -08:00 |
|
ishaan-jaff
|
7bc28f3b1c
|
(fix) pydantic errors with response.time
|
2023-11-20 18:28:19 -08:00 |
|
Krrish Dholakia
|
4b74ddcb17
|
refactor: fixing linting issues
|
2023-11-11 18:52:28 -08:00 |
|
Krrish Dholakia
|
5efa3860da
|
refactor(huggingface,-anthropic,-replicate,-sagemaker): making huggingface, anthropic, replicate, sagemaker compatible openai v1 sdk
|
2023-11-11 17:38:15 -08:00 |
|
ishaan-jaff
|
b50824de58
|
(feat) replicate add exception mapping for streaming + better logging when polling
|
2023-11-10 12:46:33 -08:00 |
|
ishaan-jaff
|
5fc73824bb
|
(fix) replicate print verbose
|
2023-11-10 12:46:33 -08:00 |
|
ishaan-jaff
|
a6bb5ca8a8
|
(feat) replicate exception mapping
|
2023-11-10 12:46:33 -08:00 |
|
ishaan-jaff
|
78c93a0c5a
|
(feat) replicate exception mapping
|
2023-11-10 12:46:33 -08:00 |
|
ishaan-jaff
|
c2b263e66d
|
(feat) completion - replicate custom deployments
|
2023-11-09 18:10:38 -08:00 |
|
ishaan-jaff
|
485a7ff136
|
(feat) use usage class for model responses for cohere, hf, tg ai, cohere
|
2023-10-27 09:58:47 -07:00 |
|
Krrish Dholakia
|
18a6facdb3
|
fix: allow api base to be set for all providers
enables proxy use cases
|
2023-10-19 19:07:42 -07:00 |
|
Krrish Dholakia
|
cc0e4f4f9f
|
fix: fix value error if model returns empty completion
|
2023-10-10 10:11:40 -07:00 |
|
Krrish Dholakia
|
69cdf5347a
|
style(test_completion.py): fix merge conflict
|
2023-10-05 22:09:38 -07:00 |
|
Krrish Dholakia
|
4ce5c1d1dc
|
fix replicate error
|
2023-10-02 21:35:16 -07:00 |
|
ishaan-jaff
|
39a77fed47
|
modify replicate polling interval
|
2023-10-02 07:43:38 -07:00 |
|
Krrish Dholakia
|
c9df59cc00
|
remove print statement in replicate.py
|
2023-09-27 10:43:06 -07:00 |
|
Krrish Dholakia
|
3cac04f459
|
fix exception mapping for streaming
|
2023-09-23 15:04:59 -07:00 |
|
ishaan-jaff
|
1eed36eb1d
|
add replicate pricing
|
2023-09-06 18:14:34 -07:00 |
|
Krrish Dholakia
|
311bfb7bb7
|
logging replicate response logs
|
2023-09-06 11:28:40 -07:00 |
|
ishaan-jaff
|
a2d425f7de
|
add flan + vicuna + fix replicate errors
|
2023-09-06 11:23:58 -07:00 |
|
Krrish Dholakia
|
8189a16188
|
updates
|
2023-09-06 11:21:48 -07:00 |
|
ishaan-jaff
|
89ebdab2b3
|
add replicate support for max_tokens
|
2023-09-06 10:38:21 -07:00 |
|
ishaan-jaff
|
d8dfa2d80d
|
add Replicate Error class
|
2023-09-06 10:25:40 -07:00 |
|
ishaan-jaff
|
74e0e90620
|
add replicate streaming
|
2023-09-06 10:23:13 -07:00 |
|
ishaan-jaff
|
6fb01ec257
|
use replicate http requests instead
|
2023-09-06 09:43:05 -07:00 |
|