Krrish Dholakia
|
76785cfb6a
|
fix(utils.py): handle anthropic overloaded error
|
2024-08-08 17:18:19 -07:00 |
|
Krish Dholakia
|
4ef7743d71
|
Merge branch 'main' into litellm_auth_fix
|
2024-08-08 17:14:16 -07:00 |
|
Ishaan Jaff
|
b58664232c
|
Merge pull request #5114 from BerriAI/litellm_safe_get_file_checksum
[Fix-Bug]: Whisper is broken
|
2024-08-08 17:11:03 -07:00 |
|
Krrish Dholakia
|
da0413f0a4
|
fix(factory.py): handle openai function message having tool call id
|
2024-08-08 17:06:50 -07:00 |
|
Ishaan Jaff
|
e671ae58e3
|
Merge pull request #5119 from BerriAI/litellm_add_gemini_context_caching_litellm
[Feat-Proxy] Add Support for VertexAI context caching
|
2024-08-08 16:08:58 -07:00 |
|
Ishaan Jaff
|
75769538e7
|
Merge pull request #5115 from BerriAI/litellm_handle_case_when_prom_svc_logger_none
fix handle case when service logger has no attribute prometheusService
|
2024-08-08 16:08:47 -07:00 |
|
Ishaan Jaff
|
1924f11b4c
|
add default_vertex_config
|
2024-08-08 16:07:26 -07:00 |
|
Ishaan Jaff
|
a3dd3a19fa
|
docs cachedContent endpoint
|
2024-08-08 16:06:23 -07:00 |
|
Ishaan Jaff
|
cae941f4c0
|
auto update the model in cachedContents request
|
2024-08-08 15:56:46 -07:00 |
|
Krrish Dholakia
|
1bf8af5cb1
|
fix(anthropic.py): fix translation from /v1/messages format to openai format
|
2024-08-08 15:55:01 -07:00 |
|
Ishaan Jaff
|
0d662368ac
|
add native cachedContents endpoint
|
2024-08-08 15:32:07 -07:00 |
|
Krrish Dholakia
|
e6a5d43ebc
|
test: skip flaky langsmith tests
|
2024-08-08 13:44:15 -07:00 |
|
Krrish Dholakia
|
32f343b947
|
test: skip flaky langsmith tests
|
2024-08-08 13:43:44 -07:00 |
|
Krrish Dholakia
|
b65a27035e
|
test(test_langsmith.py): skip flaky test
|
2024-08-08 13:42:15 -07:00 |
|
Krrish Dholakia
|
5da4c27e8d
|
fix(internal_user_endpoints.py): expose new 'internal_user_budget_duration' flag
Relevant to - https://github.com/BerriAI/litellm/issues/5106
|
2024-08-08 13:05:03 -07:00 |
|
Krrish Dholakia
|
169e8b541c
|
test(test_langsmith.py): fix test
|
2024-08-08 12:53:19 -07:00 |
|
Krrish Dholakia
|
ac601a69fc
|
feat: set max_internal_budget for user w/ sso
|
2024-08-08 12:44:07 -07:00 |
|
Ishaan Jaff
|
909977ace6
|
add example using litellm proxy with gemini context caching
|
2024-08-08 11:35:41 -07:00 |
|
Ishaan Jaff
|
7b6110bbb3
|
add tests to make sure correct vertex ai route is used
|
2024-08-08 11:28:19 -07:00 |
|
Ishaan Jaff
|
d237e625ca
|
use v1beta1 when using cached_content
|
2024-08-08 11:09:31 -07:00 |
|
Krrish Dholakia
|
5703da9b42
|
fix(user_api_key_auth.py): Fixes https://github.com/BerriAI/litellm/issues/5111
|
2024-08-08 10:30:15 -07:00 |
|
Krrish Dholakia
|
5d664c0441
|
build(model_prices_and_context_window.json): Fixes https://github.com/BerriAI/litellm/issues/5113
|
2024-08-08 09:11:59 -07:00 |
|
Krrish Dholakia
|
ee8d2f25b9
|
build: ui - update to include max budget per team
|
2024-08-08 09:09:23 -07:00 |
|
Ishaan Jaff
|
7d4e834091
|
fix handle case when service logger has no attribute prometheusServicesLogger
|
2024-08-08 08:23:29 -07:00 |
|
Ishaan Jaff
|
68a36600c2
|
fix use get_file_check_sum
|
2024-08-08 08:03:08 -07:00 |
|
prd-tuong-nguyen
|
3445174ebe
|
feat: hash prompt when caching
|
2024-08-08 16:19:14 +07:00 |
|
Krrish Dholakia
|
a03860b1cf
|
test(test_completion.py): fix merge error
|
2024-08-07 20:31:38 -07:00 |
|
Krish Dholakia
|
e1610d37b9
|
Merge pull request #5099 from BerriAI/litellm_personal_user_budgets
fix(user_api_key_auth.py): respect team budgets over user budget, if key belongs to team
|
2024-08-07 20:00:16 -07:00 |
|
Krish Dholakia
|
37ef63f522
|
Merge pull request #5101 from BerriAI/litellm_router_prioritization
feat(router.py): allows /chat/completion endpoint to work for request prioritization calls
|
2024-08-07 20:00:06 -07:00 |
|
Krish Dholakia
|
baf01b47d8
|
Merge branch 'main' into litellm_personal_user_budgets
|
2024-08-07 19:59:50 -07:00 |
|
Krrish Dholakia
|
ec0b511119
|
fix: use more descriptive flag
|
2024-08-07 18:59:46 -07:00 |
|
Krrish Dholakia
|
b7e31638fd
|
fix(internal_user_endpoints.py): respect 'max_user_budget' for new internal user's
|
2024-08-07 18:50:40 -07:00 |
|
Ishaan Jaff
|
e17e959c62
|
image gen catch when predictions not in json response
|
2024-08-07 18:50:26 -07:00 |
|
Ishaan Jaff
|
e1a78faf98
|
run that ci cd again
|
2024-08-07 18:47:11 -07:00 |
|
Krrish Dholakia
|
b4392d2b67
|
test(test_amazing_vertex_completion.py): fix test for json schema validation in openai schema
|
2024-08-07 18:39:20 -07:00 |
|
Ishaan Jaff
|
b571f3e115
|
run that ci/cd again
|
2024-08-07 18:25:52 -07:00 |
|
Krrish Dholakia
|
4e02425d8b
|
fix(vertex_ai_partner.py): pass model for llama3 param mapping
|
2024-08-07 18:16:41 -07:00 |
|
Ishaan Jaff
|
75bfe81748
|
fix all optional param tests
|
2024-08-07 18:16:41 -07:00 |
|
Ishaan Jaff
|
472a1df76f
|
fix test for wildcard routing
|
2024-08-07 18:16:41 -07:00 |
|
Ishaan Jaff
|
4c375d13d6
|
fix test_drop_params_parallel_tool_calls
|
2024-08-07 18:16:41 -07:00 |
|
Krrish Dholakia
|
a15317a377
|
fix(vertex_ai_partner.py): pass model for llama3 param mapping
|
2024-08-07 18:07:14 -07:00 |
|
Ishaan Jaff
|
4e90a25417
|
fix all optional param tests
|
2024-08-07 17:52:40 -07:00 |
|
Krrish Dholakia
|
400653992c
|
feat(router.py): allow using .acompletion() for request prioritization
allows /chat/completion endpoint to work for request prioritization calls
|
2024-08-07 16:43:12 -07:00 |
|
Ishaan Jaff
|
6b03998e32
|
fix test for wildcard routing
|
2024-08-07 16:41:00 -07:00 |
|
Ishaan Jaff
|
341e098c32
|
fix test_drop_params_parallel_tool_calls
|
2024-08-07 16:26:56 -07:00 |
|
Ishaan Jaff
|
e585dfba92
|
docs prom
|
2024-08-07 16:03:11 -07:00 |
|
Krrish Dholakia
|
639f84d609
|
test(test_completion.py): handle gemini instability
|
2024-08-07 15:58:47 -07:00 |
|
Ishaan Jaff
|
89050dc4f4
|
ci/cd run again
|
2024-08-07 15:44:54 -07:00 |
|
Ishaan Jaff
|
2c3e068435
|
fix test_team_update_redis
|
2024-08-07 15:37:02 -07:00 |
|
Krrish Dholakia
|
f76261af35
|
refactor(user_api_key_auth.py): refactor to replace user_id_information list with pydantic user_obj
Allows using the 'get_user_object' function in user_api_key_auth, keeping it consistent across jwt-auth and key-auth
|
2024-08-07 15:33:55 -07:00 |
|