Commit graph

11933 commits

Author SHA1 Message Date
Ishaan Jaff
50461eb22c feat - create budgets when team/member_add 2024-05-22 17:16:19 -07:00
Ishaan Jaff
31fc6d79af
Merge pull request #3785 from BerriAI/litellm_end_user_rate_limits
[Feat] LiteLLM Proxy: Enforce End-User TPM, RPM Limits
2024-05-22 17:12:58 -07:00
Ishaan Jaff
a4cf453ad1 fix - end user rate limiting tests 2024-05-22 16:46:19 -07:00
Krrish Dholakia
9f518030b9 build(schema.prisma): add completion_start_time to spend logs
enables time to first token tracking for models
2024-05-22 16:43:08 -07:00
Ishaan Jaff
4175d00a24 fix - test end user rate limits with master key 2024-05-22 16:23:15 -07:00
Ishaan Jaff
a848a676af docs - end user rate limiting 2024-05-22 16:20:25 -07:00
Ishaan Jaff
42078ac285 fix - run tpm / rpm checks on proxy admin keys too 2024-05-22 16:15:09 -07:00
Ishaan Jaff
e6b406d739 feat - enforce end user tpm / rpm limits 2024-05-22 15:45:30 -07:00
Ishaan Jaff
bef10f4b01 test - end user tpm / rpm limiting 2024-05-22 15:42:41 -07:00
Ishaan Jaff
106910cecf feat - add end user rate limiting 2024-05-22 14:01:57 -07:00
Ishaan Jaff
eac7e70dca feat - include litellm_budget table when getting end_user 2024-05-22 13:30:08 -07:00
Krrish Dholakia
9609df16d3 feat(proxy_server.py): new 'add budget' endpoint
create a budget object (max budget, tpm /rpm limits, etc.) and assign that to a user/team/etc.
2024-05-22 13:16:37 -07:00
Ishaan Jaff
778b41cea5 fix - update migration script to ensure api_key in script 2024-05-22 12:22:23 -07:00
Ishaan Jaff
0623eb0274
Merge pull request #3783 from BerriAI/litellm_fix_dont_use_gunicorn_helm
[Fix] - don't use `gunicorn` on litellm helm
2024-05-22 07:50:47 -07:00
Ishaan Jaff
815dcd9187 fix - don't use gunicorn on litellm helm 2024-05-22 07:48:34 -07:00
Ishaan Jaff
4e5be51373
Merge pull request #3775 from BerriAI/litellm_add_failure_Callbacks_from_db
[Feat] add failure callbacks from DB to proxy
2024-05-22 07:36:04 -07:00
Ishaan Jaff
b81fcf2482 feat -add failure callbacks from DB to proxy 2024-05-21 22:00:36 -07:00
Krish Dholakia
5d7d638494
Merge pull request #3770 from BerriAI/litellm_filter_invalid_params
feat(router.py): filter out deployments which don't support request params w/ 'pre_call_checks=True'
2024-05-21 20:42:29 -07:00
Krish Dholakia
beb61702c3
Merge branch 'main' into litellm_filter_invalid_params 2024-05-21 20:42:21 -07:00
Ishaan Jaff
908195610d bump: version 1.37.19 → 1.37.20 2024-05-21 19:11:27 -07:00
Krish Dholakia
febd57dc81
Merge pull request #3768 from BerriAI/litellm_lowest_latency_ttft_routing
feat(lowest_latency.py): route by time to first token, for streaming requests (if available)
2024-05-21 19:11:02 -07:00
Krrish Dholakia
0001b3208d test(test_alangfuse.py): skip langfuse test 2024-05-21 19:00:12 -07:00
Ishaan Jaff
e0cf055be6
Merge pull request #3772 from BerriAI/litellm_create_keys_with_permissions
[Feat] Proxy - Create Keys that can only access `/spend` routes on Admin UI
2024-05-21 18:53:49 -07:00
Ishaan Jaff
b513569a95 docs - invite team members to access /spend routes 2024-05-21 18:53:26 -07:00
Krrish Dholakia
207924d08f test(test_streaming.py): retry if openai is inconsistent with stream options 2024-05-21 18:51:24 -07:00
Krrish Dholakia
f19d7327ca fix(lowest_latency.py): set default none value for time_to_first_token in sync log success event 2024-05-21 18:42:15 -07:00
Ishaan Jaff
67b85d9dda undo ui changes 2024-05-21 18:39:25 -07:00
Krrish Dholakia
20ad1a5189 test(test_router_caching.py): fix caching tests 2024-05-21 18:07:46 -07:00
Ishaan Jaff
7cbdf02a9b ui - set permissions on keys 2024-05-21 18:06:56 -07:00
Krrish Dholakia
bfde1357dd test(test_router_caching.py): fix test 2024-05-21 18:05:05 -07:00
Ishaan Jaff
64130e368c feat - create keys with permissions 2024-05-21 18:04:17 -07:00
Krish Dholakia
aae404fdab
Merge pull request #3771 from BerriAI/litellm_force_anthropic_tool_use
Support anthropic 'tool_choice' param
2024-05-21 18:00:03 -07:00
Krrish Dholakia
af1d209f8f docs(input.md): add anthropic tool choice support to docs 2024-05-21 17:56:21 -07:00
Krrish Dholakia
f3d29a6b4a feat(anthropic.py): support anthropic 'tool_choice' param
Closes https://github.com/BerriAI/litellm/issues/3752
2024-05-21 17:50:44 -07:00
Ishaan Jaff
1d35b7543a UI - create keys limited to specific routes 2024-05-21 17:39:37 -07:00
Krrish Dholakia
124fb18ce7 test(test_logfire.py): skip logfire tests - breaking on ci/cd 2024-05-21 17:37:16 -07:00
Krrish Dholakia
8dede0c351 test(test_logfire.py): skip logfire tests 2024-05-21 17:36:17 -07:00
Krrish Dholakia
d5fd3095e4 test(test_logfire.py): skip logfire tests 2024-05-21 17:34:55 -07:00
Krrish Dholakia
cc41db018f test(test_router.py): fix testing 2024-05-21 17:31:31 -07:00
Krrish Dholakia
988970f4c2 feat(router.py): Fixes https://github.com/BerriAI/litellm/issues/3769 2024-05-21 17:24:51 -07:00
Krrish Dholakia
62e5559e0a fix(utils.py): filter out hf eos token
Closes https://github.com/BerriAI/litellm/issues/3757
2024-05-21 14:31:54 -07:00
Ishaan Jaff
33a6647fac fix don't log langfuse cache_hit in tags 2024-05-21 14:18:53 -07:00
Krrish Dholakia
eea1edb973 test(test_alangfuse.py): skip test for this pr 2024-05-21 14:08:05 -07:00
Krrish Dholakia
2b3da449c8 feat(lowest_latency.py): route by time to first token, for streaming requests (if available)
Closes https://github.com/BerriAI/litellm/issues/3574
2024-05-21 13:08:17 -07:00
Ishaan Jaff
18a9d6cf2d
Merge pull request #3765 from BerriAI/litellm_bump_sdk
[Fix] Bump OpenAI version on Litellm PIP package [OpenAI>=1.27.0]
2024-05-21 11:36:27 -07:00
Ishaan Jaff
25812d3573 bump openai version 2024-05-21 11:33:57 -07:00
Ishaan Jaff
c9fd6dbb22 feat - bump OpenAI version 2024-05-21 11:32:33 -07:00
Ishaan Jaff
42c18a68e9
Merge pull request #3759 from BeatrixCohere/beatrix/CohereClient
[Cohere] Add request source to request
2024-05-21 11:19:57 -07:00
Krrish Dholakia
620e6db027 build(model_prices_and_context_window.json): update azure/gpt-3.5-turbo base model pricing 2024-05-21 10:58:16 -07:00
Ishaan Jaff
a7738c3b69
Merge pull request #3763 from BerriAI/docs_add_meta_llama3
docs - add bedrock meta llama3
2024-05-21 10:39:26 -07:00