Commit graph

10374 commits

Author SHA1 Message Date
Krrish Dholakia
76785cfb6a fix(utils.py): handle anthropic overloaded error 2024-08-08 17:18:19 -07:00
Krish Dholakia
4ef7743d71
Merge branch 'main' into litellm_auth_fix 2024-08-08 17:14:16 -07:00
Ishaan Jaff
b58664232c
Merge pull request #5114 from BerriAI/litellm_safe_get_file_checksum
[Fix-Bug]: Whisper is broken
2024-08-08 17:11:03 -07:00
Krrish Dholakia
da0413f0a4 fix(factory.py): handle openai function message having tool call id 2024-08-08 17:06:50 -07:00
Ishaan Jaff
e671ae58e3
Merge pull request #5119 from BerriAI/litellm_add_gemini_context_caching_litellm
[Feat-Proxy] Add Support for VertexAI context caching
2024-08-08 16:08:58 -07:00
Ishaan Jaff
75769538e7
Merge pull request #5115 from BerriAI/litellm_handle_case_when_prom_svc_logger_none
fix handle case when service logger has no attribute prometheusService
2024-08-08 16:08:47 -07:00
Ishaan Jaff
1924f11b4c add default_vertex_config 2024-08-08 16:07:26 -07:00
Ishaan Jaff
a3dd3a19fa docs cachedContent endpoint 2024-08-08 16:06:23 -07:00
Ishaan Jaff
cae941f4c0 auto update the model in cachedContents request 2024-08-08 15:56:46 -07:00
Krrish Dholakia
1bf8af5cb1 fix(anthropic.py): fix translation from /v1/messages format to openai format 2024-08-08 15:55:01 -07:00
Ishaan Jaff
0d662368ac add native cachedContents endpoint 2024-08-08 15:32:07 -07:00
Krrish Dholakia
e6a5d43ebc test: skip flaky langsmith tests 2024-08-08 13:44:15 -07:00
Krrish Dholakia
32f343b947 test: skip flaky langsmith tests 2024-08-08 13:43:44 -07:00
Krrish Dholakia
b65a27035e test(test_langsmith.py): skip flaky test 2024-08-08 13:42:15 -07:00
Krrish Dholakia
5da4c27e8d fix(internal_user_endpoints.py): expose new 'internal_user_budget_duration' flag
Relevant to - https://github.com/BerriAI/litellm/issues/5106
2024-08-08 13:05:03 -07:00
Krrish Dholakia
169e8b541c test(test_langsmith.py): fix test 2024-08-08 12:53:19 -07:00
Krrish Dholakia
ac601a69fc feat: set max_internal_budget for user w/ sso 2024-08-08 12:44:07 -07:00
Ishaan Jaff
909977ace6 add example using litellm proxy with gemini context caching 2024-08-08 11:35:41 -07:00
Ishaan Jaff
7b6110bbb3 add tests to make sure correct vertex ai route is used 2024-08-08 11:28:19 -07:00
Ishaan Jaff
d237e625ca use v1beta1 when using cached_content 2024-08-08 11:09:31 -07:00
Krrish Dholakia
5703da9b42 fix(user_api_key_auth.py): Fixes https://github.com/BerriAI/litellm/issues/5111 2024-08-08 10:30:15 -07:00
Krrish Dholakia
5d664c0441 build(model_prices_and_context_window.json): Fixes https://github.com/BerriAI/litellm/issues/5113 2024-08-08 09:11:59 -07:00
Krrish Dholakia
ee8d2f25b9 build: ui - update to include max budget per team 2024-08-08 09:09:23 -07:00
Ishaan Jaff
7d4e834091 fix handle case when service logger has no attribute prometheusServicesLogger 2024-08-08 08:23:29 -07:00
Ishaan Jaff
68a36600c2 fix use get_file_check_sum 2024-08-08 08:03:08 -07:00
prd-tuong-nguyen
3445174ebe feat: hash prompt when caching 2024-08-08 16:19:14 +07:00
Krrish Dholakia
a03860b1cf test(test_completion.py): fix merge error 2024-08-07 20:31:38 -07:00
Krish Dholakia
e1610d37b9
Merge pull request #5099 from BerriAI/litellm_personal_user_budgets
fix(user_api_key_auth.py): respect team budgets over user budget, if key belongs to team
2024-08-07 20:00:16 -07:00
Krish Dholakia
37ef63f522
Merge pull request #5101 from BerriAI/litellm_router_prioritization
feat(router.py): allows /chat/completion endpoint to work for request prioritization calls
2024-08-07 20:00:06 -07:00
Krish Dholakia
baf01b47d8
Merge branch 'main' into litellm_personal_user_budgets 2024-08-07 19:59:50 -07:00
Krrish Dholakia
ec0b511119 fix: use more descriptive flag 2024-08-07 18:59:46 -07:00
Krrish Dholakia
b7e31638fd fix(internal_user_endpoints.py): respect 'max_user_budget' for new internal user's 2024-08-07 18:50:40 -07:00
Ishaan Jaff
e17e959c62 image gen catch when predictions not in json response 2024-08-07 18:50:26 -07:00
Ishaan Jaff
e1a78faf98 run that ci cd again 2024-08-07 18:47:11 -07:00
Krrish Dholakia
b4392d2b67 test(test_amazing_vertex_completion.py): fix test for json schema validation in openai schema 2024-08-07 18:39:20 -07:00
Ishaan Jaff
b571f3e115 run that ci/cd again 2024-08-07 18:25:52 -07:00
Krrish Dholakia
4e02425d8b fix(vertex_ai_partner.py): pass model for llama3 param mapping 2024-08-07 18:16:41 -07:00
Ishaan Jaff
75bfe81748 fix all optional param tests 2024-08-07 18:16:41 -07:00
Ishaan Jaff
472a1df76f fix test for wildcard routing 2024-08-07 18:16:41 -07:00
Ishaan Jaff
4c375d13d6 fix test_drop_params_parallel_tool_calls 2024-08-07 18:16:41 -07:00
Krrish Dholakia
a15317a377 fix(vertex_ai_partner.py): pass model for llama3 param mapping 2024-08-07 18:07:14 -07:00
Ishaan Jaff
4e90a25417 fix all optional param tests 2024-08-07 17:52:40 -07:00
Krrish Dholakia
400653992c feat(router.py): allow using .acompletion() for request prioritization
allows /chat/completion endpoint to work for request prioritization calls
2024-08-07 16:43:12 -07:00
Ishaan Jaff
6b03998e32 fix test for wildcard routing 2024-08-07 16:41:00 -07:00
Ishaan Jaff
341e098c32 fix test_drop_params_parallel_tool_calls 2024-08-07 16:26:56 -07:00
Ishaan Jaff
e585dfba92 docs prom 2024-08-07 16:03:11 -07:00
Krrish Dholakia
639f84d609 test(test_completion.py): handle gemini instability 2024-08-07 15:58:47 -07:00
Ishaan Jaff
89050dc4f4 ci/cd run again 2024-08-07 15:44:54 -07:00
Ishaan Jaff
2c3e068435 fix test_team_update_redis 2024-08-07 15:37:02 -07:00
Krrish Dholakia
f76261af35 refactor(user_api_key_auth.py): refactor to replace user_id_information list with pydantic user_obj
Allows using the 'get_user_object' function in user_api_key_auth, keeping it consistent across jwt-auth and key-auth
2024-08-07 15:33:55 -07:00