Ishaan Jaff
|
50461eb22c
|
feat - create budgets when team/member_add
|
2024-05-22 17:16:19 -07:00 |
|
Ishaan Jaff
|
31fc6d79af
|
Merge pull request #3785 from BerriAI/litellm_end_user_rate_limits
[Feat] LiteLLM Proxy: Enforce End-User TPM, RPM Limits
|
2024-05-22 17:12:58 -07:00 |
|
Ishaan Jaff
|
a4cf453ad1
|
fix - end user rate limiting tests
|
2024-05-22 16:46:19 -07:00 |
|
Krrish Dholakia
|
9f518030b9
|
build(schema.prisma): add completion_start_time to spend logs
enables time to first token tracking for models
|
2024-05-22 16:43:08 -07:00 |
|
Ishaan Jaff
|
4175d00a24
|
fix - test end user rate limits with master key
|
2024-05-22 16:23:15 -07:00 |
|
Ishaan Jaff
|
a848a676af
|
docs - end user rate limiting
|
2024-05-22 16:20:25 -07:00 |
|
Ishaan Jaff
|
42078ac285
|
fix - run tpm / rpm checks on proxy admin keys too
|
2024-05-22 16:15:09 -07:00 |
|
Ishaan Jaff
|
e6b406d739
|
feat - enforce end user tpm / rpm limits
|
2024-05-22 15:45:30 -07:00 |
|
Ishaan Jaff
|
bef10f4b01
|
test - end user tpm / rpm limiting
|
2024-05-22 15:42:41 -07:00 |
|
Ishaan Jaff
|
106910cecf
|
feat - add end user rate limiting
|
2024-05-22 14:01:57 -07:00 |
|
Ishaan Jaff
|
eac7e70dca
|
feat - include litellm_budget table when getting end_user
|
2024-05-22 13:30:08 -07:00 |
|
Krrish Dholakia
|
9609df16d3
|
feat(proxy_server.py): new 'add budget' endpoint
create a budget object (max budget, tpm /rpm limits, etc.) and assign that to a user/team/etc.
|
2024-05-22 13:16:37 -07:00 |
|
Ishaan Jaff
|
778b41cea5
|
fix - update migration script to ensure api_key in script
|
2024-05-22 12:22:23 -07:00 |
|
Ishaan Jaff
|
0623eb0274
|
Merge pull request #3783 from BerriAI/litellm_fix_dont_use_gunicorn_helm
[Fix] - don't use `gunicorn` on litellm helm
|
2024-05-22 07:50:47 -07:00 |
|
Ishaan Jaff
|
815dcd9187
|
fix - don't use gunicorn on litellm helm
|
2024-05-22 07:48:34 -07:00 |
|
Ishaan Jaff
|
4e5be51373
|
Merge pull request #3775 from BerriAI/litellm_add_failure_Callbacks_from_db
[Feat] add failure callbacks from DB to proxy
|
2024-05-22 07:36:04 -07:00 |
|
Ishaan Jaff
|
b81fcf2482
|
feat -add failure callbacks from DB to proxy
|
2024-05-21 22:00:36 -07:00 |
|
Krish Dholakia
|
5d7d638494
|
Merge pull request #3770 from BerriAI/litellm_filter_invalid_params
feat(router.py): filter out deployments which don't support request params w/ 'pre_call_checks=True'
|
2024-05-21 20:42:29 -07:00 |
|
Krish Dholakia
|
beb61702c3
|
Merge branch 'main' into litellm_filter_invalid_params
|
2024-05-21 20:42:21 -07:00 |
|
Ishaan Jaff
|
908195610d
|
bump: version 1.37.19 → 1.37.20
|
2024-05-21 19:11:27 -07:00 |
|
Krish Dholakia
|
febd57dc81
|
Merge pull request #3768 from BerriAI/litellm_lowest_latency_ttft_routing
feat(lowest_latency.py): route by time to first token, for streaming requests (if available)
|
2024-05-21 19:11:02 -07:00 |
|
Krrish Dholakia
|
0001b3208d
|
test(test_alangfuse.py): skip langfuse test
|
2024-05-21 19:00:12 -07:00 |
|
Ishaan Jaff
|
e0cf055be6
|
Merge pull request #3772 from BerriAI/litellm_create_keys_with_permissions
[Feat] Proxy - Create Keys that can only access `/spend` routes on Admin UI
|
2024-05-21 18:53:49 -07:00 |
|
Ishaan Jaff
|
b513569a95
|
docs - invite team members to access /spend routes
|
2024-05-21 18:53:26 -07:00 |
|
Krrish Dholakia
|
207924d08f
|
test(test_streaming.py): retry if openai is inconsistent with stream options
|
2024-05-21 18:51:24 -07:00 |
|
Krrish Dholakia
|
f19d7327ca
|
fix(lowest_latency.py): set default none value for time_to_first_token in sync log success event
|
2024-05-21 18:42:15 -07:00 |
|
Ishaan Jaff
|
67b85d9dda
|
undo ui changes
|
2024-05-21 18:39:25 -07:00 |
|
Krrish Dholakia
|
20ad1a5189
|
test(test_router_caching.py): fix caching tests
|
2024-05-21 18:07:46 -07:00 |
|
Ishaan Jaff
|
7cbdf02a9b
|
ui - set permissions on keys
|
2024-05-21 18:06:56 -07:00 |
|
Krrish Dholakia
|
bfde1357dd
|
test(test_router_caching.py): fix test
|
2024-05-21 18:05:05 -07:00 |
|
Ishaan Jaff
|
64130e368c
|
feat - create keys with permissions
|
2024-05-21 18:04:17 -07:00 |
|
Krish Dholakia
|
aae404fdab
|
Merge pull request #3771 from BerriAI/litellm_force_anthropic_tool_use
Support anthropic 'tool_choice' param
|
2024-05-21 18:00:03 -07:00 |
|
Krrish Dholakia
|
af1d209f8f
|
docs(input.md): add anthropic tool choice support to docs
|
2024-05-21 17:56:21 -07:00 |
|
Krrish Dholakia
|
f3d29a6b4a
|
feat(anthropic.py): support anthropic 'tool_choice' param
Closes https://github.com/BerriAI/litellm/issues/3752
|
2024-05-21 17:50:44 -07:00 |
|
Ishaan Jaff
|
1d35b7543a
|
UI - create keys limited to specific routes
|
2024-05-21 17:39:37 -07:00 |
|
Krrish Dholakia
|
124fb18ce7
|
test(test_logfire.py): skip logfire tests - breaking on ci/cd
|
2024-05-21 17:37:16 -07:00 |
|
Krrish Dholakia
|
8dede0c351
|
test(test_logfire.py): skip logfire tests
|
2024-05-21 17:36:17 -07:00 |
|
Krrish Dholakia
|
d5fd3095e4
|
test(test_logfire.py): skip logfire tests
|
2024-05-21 17:34:55 -07:00 |
|
Krrish Dholakia
|
cc41db018f
|
test(test_router.py): fix testing
|
2024-05-21 17:31:31 -07:00 |
|
Krrish Dholakia
|
988970f4c2
|
feat(router.py): Fixes https://github.com/BerriAI/litellm/issues/3769
|
2024-05-21 17:24:51 -07:00 |
|
Krrish Dholakia
|
62e5559e0a
|
fix(utils.py): filter out hf eos token
Closes https://github.com/BerriAI/litellm/issues/3757
|
2024-05-21 14:31:54 -07:00 |
|
Ishaan Jaff
|
33a6647fac
|
fix don't log langfuse cache_hit in tags
|
2024-05-21 14:18:53 -07:00 |
|
Krrish Dholakia
|
eea1edb973
|
test(test_alangfuse.py): skip test for this pr
|
2024-05-21 14:08:05 -07:00 |
|
Krrish Dholakia
|
2b3da449c8
|
feat(lowest_latency.py): route by time to first token, for streaming requests (if available)
Closes https://github.com/BerriAI/litellm/issues/3574
|
2024-05-21 13:08:17 -07:00 |
|
Ishaan Jaff
|
18a9d6cf2d
|
Merge pull request #3765 from BerriAI/litellm_bump_sdk
[Fix] Bump OpenAI version on Litellm PIP package [OpenAI>=1.27.0]
|
2024-05-21 11:36:27 -07:00 |
|
Ishaan Jaff
|
25812d3573
|
bump openai version
|
2024-05-21 11:33:57 -07:00 |
|
Ishaan Jaff
|
c9fd6dbb22
|
feat - bump OpenAI version
|
2024-05-21 11:32:33 -07:00 |
|
Ishaan Jaff
|
42c18a68e9
|
Merge pull request #3759 from BeatrixCohere/beatrix/CohereClient
[Cohere] Add request source to request
|
2024-05-21 11:19:57 -07:00 |
|
Krrish Dholakia
|
620e6db027
|
build(model_prices_and_context_window.json): update azure/gpt-3.5-turbo base model pricing
|
2024-05-21 10:58:16 -07:00 |
|
Ishaan Jaff
|
a7738c3b69
|
Merge pull request #3763 from BerriAI/docs_add_meta_llama3
docs - add bedrock meta llama3
|
2024-05-21 10:39:26 -07:00 |
|