Commit graph

3832 commits

Author SHA1 Message Date
ishaan-jaff
2c74dbed17 (chore) util: remove_model_id 2023-11-29 17:30:33 -08:00
ishaan-jaff
7a38a45d62 (test) test weighted selection router 2023-11-29 17:30:18 -08:00
ishaan-jaff
48416f8018 (test) add rpm to load test profiling 2023-11-29 17:14:34 -08:00
ishaan-jaff
088d2bc081 (fix) use weighted shuffle when rpm set 2023-11-29 17:13:11 -08:00
Krrish Dholakia
38efc21f81 bump: version 1.7.18 → 1.7.19 2023-11-29 16:50:11 -08:00
Krrish Dholakia
61185aa12c fix(main.py): fix null finish reason issue for ollama 2023-11-29 16:50:11 -08:00
ishaan-jaff
69eca78000 (docs) simple proxy 2023-11-29 16:44:40 -08:00
Krrish Dholakia
c2f642dbec bump: version 1.7.17 → 1.7.18 2023-11-29 16:43:11 -08:00
Krrish Dholakia
5411d5a6fd fix(utils.py): raise stop iteration exception on bedrock stream close 2023-11-29 16:43:11 -08:00
Ishaan Jaff
286ce586be
Update README.md 2023-11-29 16:40:16 -08:00
ishaan-jaff
4b78481fbd (docs) simple proxy 2023-11-29 16:38:36 -08:00
ishaan-jaff
2d0432c5b7 (docs) simple proxy 2023-11-29 16:36:07 -08:00
Krrish Dholakia
52c9159a54 bump: version 1.7.16 → 1.7.17 2023-11-29 16:35:06 -08:00
Krrish Dholakia
ab76daa90b fix(bedrock.py): support ai21 / bedrock streaming 2023-11-29 16:35:06 -08:00
ishaan-jaff
3b89cff65e (docs) simple proxy 2023-11-29 16:33:00 -08:00
ishaan-jaff
032cd0121b (docs) simple proxy 2023-11-29 16:31:08 -08:00
ishaan-jaff
3cc8305ec6 (fix) proxy: /health 2023-11-29 16:23:37 -08:00
ishaan-jaff
d3672452ce (test) 1k requests 2023-11-29 16:22:18 -08:00
ishaan-jaff
3c6764efef (feat) proxy+ router: support 1k request/second 2023-11-29 16:22:04 -08:00
ishaan-jaff
da75b15176 (feat) completion: add rpm, tpm as litellm params 2023-11-29 16:19:05 -08:00
ishaan-jaff
9bf603889f (fix) azure: remove max retries before completion 2023-11-29 16:09:31 -08:00
ishaan-jaff
66bc0fc343 (fix) proxy: /health works with router updates 2023-11-29 16:09:31 -08:00
ishaan-jaff
8a398a1777 (feat) proxy: add weighted shuffle + set cooldown to 1s 2023-11-29 16:09:31 -08:00
ishaan-jaff
2bbd9c063d (fix) OpenAI embedding 2023-11-29 16:09:31 -08:00
Krrish Dholakia
3c254cc555 bump: version 1.7.15 → 1.7.16 2023-11-29 16:04:22 -08:00
Krrish Dholakia
96a27ce954 fix(utils.py): stop sequence filtering for amazon titan models 2023-11-29 16:04:14 -08:00
Krrish Dholakia
dccfe1cc3e bump: version 1.7.14 → 1.7.15 2023-11-29 15:37:21 -08:00
Krrish Dholakia
04a1c20bc5 fix(router.py): skip api key when generating model id for router deployments 2023-11-29 15:37:08 -08:00
Krrish Dholakia
6c98715b94 bump: version 1.7.13 → 1.7.14 2023-11-29 15:19:18 -08:00
Krrish Dholakia
451851e6a4 fix(main.py): have stream_chunk_builder return successful response even if token_counter fails 2023-11-29 15:19:11 -08:00
Krrish Dholakia
e41bd77df4 bump: version 1.7.12 → 1.7.13 2023-11-29 12:22:27 -08:00
Krrish Dholakia
7175cac1de test(test_custom_logger.py): comment out streaming/success test due to pytest build issues 2023-11-29 12:18:02 -08:00
Krrish Dholakia
2b437a2699 fix(utils.py): return last streaming chunk 2023-11-29 12:11:08 -08:00
Krrish Dholakia
a9ed768991 fix(azure.py): fix error handling for openai/azure streaming 2023-11-29 11:52:24 -08:00
Krrish Dholakia
b6bc75e27a fix(utils.py): fix parallel tool calling when streaming 2023-11-29 10:56:21 -08:00
Krrish Dholakia
9024a47dc2 fix(utils.py): bedrock/cohere optional params 2023-11-29 08:08:48 -08:00
ishaan-jaff
f307e82a41 (fix) proxy: making receiving data print_verbose 2023-11-29 07:50:52 -08:00
Krrish Dholakia
5fc7cd28d9 bump: version 1.7.11 → 1.7.12 2023-11-28 21:34:56 -08:00
ishaan-jaff
6f71299bb0 (fix) embedding pop out client from params 2023-11-28 21:22:01 -08:00
Krrish Dholakia
383dd53e86 fix(main.py): passing client as a litellm-specific kwarg 2023-11-28 21:20:05 -08:00
Krrish Dholakia
3154cd1758 test(test_exceptions.py): cleanup testing 2023-11-28 21:11:49 -08:00
ishaan-jaff
4c4dde60c7 (ci/cd) test exceptions 2023-11-28 21:11:12 -08:00
ishaan-jaff
afd20098be (feat) router: init client for OpenAI compatible providers 2023-11-28 17:49:53 -08:00
Krrish Dholakia
b9ae6275ca bump: version 1.7.10 → 1.7.11 2023-11-28 17:42:57 -08:00
Krrish Dholakia
4e9aa0d338 fix(utils.py): fix bedrock/cohere supported params 2023-11-28 17:42:50 -08:00
Krrish Dholakia
2b06fea4a8 fix(proxy_server.py): ensure /models returns unique model names 2023-11-28 17:32:20 -08:00
Krrish Dholakia
9d4e427250 refactor(requirements.txt): adding google-genai for proxy requirements 2023-11-28 17:27:33 -08:00
Krrish Dholakia
536abe745e bump: version 1.7.9 → 1.7.10 2023-11-28 17:25:03 -08:00
Krrish Dholakia
bb1267eb07 fix(router.py): fix exponential backoff to use retry-after if present in headers 2023-11-28 17:25:03 -08:00
ishaan-jaff
0f0ddcc0fb (fix) using AzureOpenAI client 2023-11-28 17:17:40 -08:00