Commit graph

9378 commits

Author SHA1 Message Date
Ishaan Jaff
d04b4dea3e (docs) best prod practices 2024-03-23 19:29:21 -07:00
Krrish Dholakia
e8e7964025 docs(routing.md): add pre-call checks to docs 2024-03-23 19:10:34 -07:00
Krrish Dholakia
b7321ae4ee fix(router.py): fix pre call check logic 2024-03-23 18:56:08 -07:00
Ishaan Jaff
2ae489c506 (docs) update config set_verbose 2024-03-23 18:54:31 -07:00
Ishaan Jaff
04a09830de
Merge pull request #2668 from BerriAI/litellm_update_deploy_docs
[Docs] Add Docs on deploying to EKS Cluster + K8
2024-03-23 18:42:36 -07:00
Krrish Dholakia
9f83cf95c6 bump: version 1.33.8 → 1.33.9 2024-03-23 18:26:52 -07:00
Krrish Dholakia
eb3ca85d7e feat(router.py): enable pre-call checks
filter models outside of context window limits of a given message for a model group

 https://github.com/BerriAI/litellm/issues/872
2024-03-23 18:03:30 -07:00
Ishaan Jaff
925e785a6a (fix) update locustfile 2024-03-23 17:59:54 -07:00
Ishaan Jaff
0c17fe9a32 (feat) update config.yaml 2024-03-23 17:59:16 -07:00
Ishaan Jaff
b0c75d405c (feat) update kub.yaml 2024-03-23 17:58:39 -07:00
Ishaan Jaff
30ae52c21e (docs) using litellm on EKS 2024-03-23 17:49:00 -07:00
Ishaan Jaff
f646a4612b
Merge pull request #2667 from BerriAI/litellm_update_gunicorn_instructions
[Docs] update gunicorn instructions - Uvicorn perf is significantly better on K8s
2024-03-23 17:43:09 -07:00
Ishaan Jaff
19a1d999ec (feat) update docs to not include gunicorn usage 2024-03-23 17:40:22 -07:00
Ishaan Jaff
61d2e91632 (docs) update gunicorn usage 2024-03-23 17:39:07 -07:00
Krrish Dholakia
2fabff06c0 fix(bedrock.py): fix supported openai params for bedrock claude 3 2024-03-23 16:02:15 -07:00
Krrish Dholakia
c81c9c2583 fix(proxy_server.py): fix model info check 2024-03-23 15:59:17 -07:00
Krish Dholakia
5bcf92f4f5
Merge pull request #2659 from vivek-athina/athina-doc-updates-and-a-minor-fix
Athina docs updated with information about additional fields and a minor fix in the callback
2024-03-23 15:54:32 -07:00
Krrish Dholakia
2a9fd4c28d test(test_completion.py): make default claude 3 test message multi-turn 2024-03-23 14:34:42 -07:00
Dragos Campean
a016a6bfd8 Update poetry.lock after deps update from main merge 2024-03-23 21:40:46 +02:00
Dragos Campean
9203eae056 Merge remote-tracking branch 'upstream/main' into fix-pip-install-extra-proxy 2024-03-23 21:37:48 +02:00
Krrish Dholakia
05dfc9e6ac bump: version 1.33.7 → 1.33.8 2024-03-23 10:45:30 -07:00
Krrish Dholakia
8bd626197a Revert "bump: version 1.33.7 → 1.33.7.dev1"
This reverts commit 1b24beb7c7.
2024-03-23 10:43:12 -07:00
Krrish Dholakia
f0bee037ad build(test_python_38.py): add testing for litellm cli import 2024-03-23 10:42:19 -07:00
Krrish Dholakia
1b24beb7c7 bump: version 1.33.7 → 1.33.7.dev1 2024-03-23 10:31:38 -07:00
Krrish Dholakia
24204cec6b build(pyproject.toml): fix dependency issue re: cryptography 2024-03-23 10:31:31 -07:00
Dragos Campean
f38d8cf88e Fix pip install 'litellm[extra_proxy]' 2024-03-23 17:32:57 +02:00
Krrish Dholakia
9b951b906d test(test_completion.py): fix claude multi-turn conversation test 2024-03-23 00:56:41 -07:00
Krrish Dholakia
b9143a0a00 fix(factory.py): fix anthropic check 2024-03-23 00:27:24 -07:00
Vivek Aditya
efc90b04c7 minor fix 2024-03-23 12:50:46 +05:30
Vivek Aditya
6bd49c6087 Athina docs updated with information about additional fields and a minor fix in the callback 2024-03-23 12:42:07 +05:30
Ishaan Jaff
68fd487c2e bump: version 1.33.6 → 1.33.7 2024-03-22 22:16:14 -07:00
Ishaan Jaff
3639b93d9f bump: version 1.33.5 → 1.33.6 2024-03-22 22:16:12 -07:00
Ishaan Jaff
5ae84d13ba
Merge pull request #2657 from BerriAI/litellm_improve_perf
(feat) remove litellm.telemetry - improve perf on EC2 tiny machines by 90%
2024-03-22 22:15:41 -07:00
Tasha Upchurch
ab919004a2
Update utils.py
fix for constructed from dict choices.message being a dict still instead of Message class.
2024-03-23 00:12:24 -04:00
Ishaan Jaff
f39f606e02 (feat) remove litellm.telemetry 2024-03-22 20:58:14 -07:00
Tasha Upchurch
79201449d2
Update utils.py
Fix for creating an empty choices if no choices passed in
2024-03-22 23:39:17 -04:00
Tasha Upchurch
9201c122e7
Update utils.py
fix for #2655
2024-03-22 23:13:24 -04:00
Krrish Dholakia
42a7588b04 fix(anthropic.py): support async claude 3 tool calling + streaming
https://github.com/BerriAI/litellm/issues/2644
2024-03-22 19:57:01 -07:00
Ishaan Jaff
2e284a0cfe
Merge pull request #2653 from BerriAI/litellm_add_example_kub_yamls
[FEAT] Add example Kubernetes + Service YAML Files
2024-03-22 19:49:11 -07:00
Ishaan Jaff
aca6ec85e2 (fix) add some better load testing 2024-03-22 19:48:54 -07:00
Ishaan Jaff
28e62af4e1 (fix) update load test used 2024-03-22 19:48:54 -07:00
Ishaan Jaff
9c483dbae4 (feat) add sample kubernetes for litellm 2024-03-22 19:47:44 -07:00
Ishaan Jaff
311918b99c (fix) add some better load testing 2024-03-22 19:45:24 -07:00
Ishaan Jaff
48b9250a3d (fix) update load test used 2024-03-22 19:44:16 -07:00
Krish Dholakia
e2d81722d2
Merge pull request #2650 from BerriAI/litellm_jwt_auth_fixes
feat(handle_jwt.py): enable jwt-project based auth
2024-03-22 19:32:46 -07:00
Krrish Dholakia
90465ff00a bump: version 1.33.4 → 1.33.5 2024-03-22 18:15:42 -07:00
Krrish Dholakia
691a83b7dc fix(anthropic.py): handle multiple system prompts 2024-03-22 18:14:15 -07:00
Krrish Dholakia
265dd5cd4f docs(token_auth.md): add project based auth to docs 2024-03-22 17:27:40 -07:00
Krrish Dholakia
d06b9a5a47 fix(proxy_server.py): enable jwt-auth for users
allow a user to auth into the proxy via jwt's and call allowed routes
2024-03-22 17:08:10 -07:00
Krrish Dholakia
9bf086386e fix(handle_jwt.py): add more logging for jwt header 2024-03-22 16:33:32 -07:00