Commit graph

3565 commits

Author SHA1 Message Date
Krrish Dholakia
83c5b48842 fix(rds_iam_token.py): fix boto3 client init for rds 2024-08-28 12:15:57 -07:00
Krrish Dholakia
65097d595c fix(proxy/utils.py): fix model dump to exclude none values 2024-08-28 12:02:44 -07:00
Krrish Dholakia
76c38c6d4b build(model_prices_and_context_window.json): add bedrock mistral small 2024-08-28 10:12:16 -07:00
Ishaan Jaff
c27640e6e4 add /rerank test 2024-08-27 17:50:37 -07:00
Ishaan Jaff
5f2f7aa754 feat - add rerank on proxy 2024-08-27 17:36:40 -07:00
Ishaan Jaff
06529f19df
Merge pull request #5392 from BerriAI/litellm_add_native_cohere_rerank
[Feat] Add cohere rerank and together ai rerank
2024-08-27 17:29:37 -07:00
Ishaan Jaff
fb5be57bb8 v0 add rerank on litellm proxy 2024-08-27 17:28:39 -07:00
Krrish Dholakia
bd3057e495 test(test_proxy_exception_mapping): loosen assert 2024-08-27 16:14:30 -07:00
Krrish Dholakia
6431af0678 fix(bedrock_httpx.py): support 'Auth' header as extra_header
Fixes https://github.com/BerriAI/litellm/issues/5389#issuecomment-2313677977
2024-08-27 16:08:54 -07:00
Krrish Dholakia
2cf149fbad perf(sagemaker.py): asyncify hf prompt template check
leads to 189% improvement in RPS @ 100 users
2024-08-27 07:37:06 -07:00
Ishaan Jaff
a44596a067 fix created_at and updated_at not existing error 2024-08-26 21:04:39 -07:00
Ishaan Jaff
95455c8849 fix entrypoint 2024-08-26 20:32:23 -07:00
Ishaan Jaff
f3b3f39eb5 ui new build 2024-08-26 19:01:35 -07:00
Ishaan Jaff
75cbbea071 enforce regenerating keys in enterprise tier 2024-08-26 18:54:50 -07:00
Ishaan Jaff
4dc2eea58d update key name when regenerating a key 2024-08-26 18:40:51 -07:00
Ishaan Jaff
2615edc468 allow using hashed api keys on regen key 2024-08-26 18:15:52 -07:00
Ishaan Jaff
7230ee1f55 add regenerate_key_fn 2024-08-26 17:59:44 -07:00
Ishaan Jaff
5745f3d6cc fix schema 2024-08-26 17:27:06 -07:00
Ishaan Jaff
cbef0c0a0d add key_state created at to token 2024-08-26 16:52:33 -07:00
Ishaan Jaff
fb150f7ce5 update schema 2024-08-26 16:52:19 -07:00
Krrish Dholakia
64952ab044 fix: fix tests 2024-08-24 19:32:22 -07:00
Krrish Dholakia
5019e0322f fix(utils.py): fix linting errors 2024-08-24 17:51:59 -07:00
Ishaan Jaff
d9769c393e ui new build 2024-08-24 16:45:53 -07:00
Ishaan Jaff
20840eaad3 fix linting errors when adding a new team member 2024-08-24 16:38:43 -07:00
Ishaan Jaff
11187920ec
Merge pull request #5352 from BerriAI/litellm_allow_setting_caching_mode
[Feat-Caching] allow setting caching mode to default off
2024-08-24 16:27:45 -07:00
Ishaan Jaff
74f0e60962 fix set Caching Default Off 2024-08-24 09:43:39 -07:00
Krrish Dholakia
8782ee444d feat(team_endpoints.py): expose 2 new fields - updated_users and updated_team_memberships, on /team/member_add
Fixes https://github.com/BerriAI/litellm/issues/5345#issuecomment-2307863265

Helps user see the stored budget per team member
2024-08-24 09:22:16 -07:00
Krrish Dholakia
ac9a1e65ab fix(proxy_server.py): fix post /v1/batches endpoint
Fixes https://github.com/BerriAI/litellm/issues/5279#issuecomment-2307919820
2024-08-23 20:38:00 -07:00
Ishaan Jaff
2116046b91
Merge pull request #5346 from BerriAI/litellm_Add_vertex_text_to_speech
[Feat-LiteLLM] Add Vertex AI - Text to speech support
2024-08-23 18:29:53 -07:00
Ishaan Jaff
8fada93fff docs on using vertex tts 2024-08-23 17:57:49 -07:00
Krrish Dholakia
ab28e55b76 fix(proxy_server.py): support env vars for controlling global max parallel request retry/timeouts
fixes issue where litellm module level settings weren't working for global retries, due to time of init
2024-08-23 16:06:08 -07:00
Ishaan Jaff
92e5cd113d
Merge pull request #5342 from BerriAI/litellm_add_custom_guardrails
[Feat-Proxy] Add Custom Guardrails
2024-08-23 12:11:31 -07:00
Ishaan Jaff
918e4fcfe5 feat add test for custom guardrails 2024-08-23 12:01:43 -07:00
Ishaan Jaff
1b1e0f2d77 init custom guardrail class 2024-08-23 10:54:42 -07:00
Ishaan Jaff
7d30188f84 custom_callbacks 2024-08-23 09:52:52 -07:00
Ishaan Jaff
a99258440c fix use guardrail for pre call hook 2024-08-23 09:34:08 -07:00
Krrish Dholakia
93ed8c7216 fix(utils.py): handle additionalProperties is False for vertex ai / gemini calls
Fixes https://github.com/BerriAI/litellm/issues/5338

Also adds together ai json mode support
2024-08-23 09:21:32 -07:00
Ishaan Jaff
af92cff44d add custom guardrail reference 2024-08-23 08:32:07 -07:00
Ishaan Jaff
e62d0c7922 add the ability to init a custom guardrail 2024-08-23 08:21:07 -07:00
Krrish Dholakia
afb00a27cb fix(litellm_pre_call_utils.py): don't override k-v pair sent in spend_logs_metadata by user 2024-08-23 07:10:18 -07:00
Krish Dholakia
76b3db334b
Merge branch 'main' into litellm_azure_batch_apis 2024-08-22 19:07:54 -07:00
Ishaan Jaff
228252b92d
Merge branch 'main' into litellm_allow_using_azure_ad_token_auth 2024-08-22 18:21:24 -07:00
Ishaan Jaff
70f9e41ed9
Merge branch 'main' into litellm_add_bedrock_guardrails 2024-08-22 17:28:49 -07:00
Ishaan Jaff
1f0cc72531 test bedrock guardrails 2024-08-22 17:24:42 -07:00
Krrish Dholakia
735fc804ed fix(proxy_server.py): expose flag to disable retries when max parallel request limit is hit 2024-08-22 16:49:52 -07:00
Krrish Dholakia
7398e94472 fix(files_endpoints.py): fix multiple args error 2024-08-22 16:42:44 -07:00
Krrish Dholakia
d7b525f391 feat(auth_checks.py): allow team to call all models, when explicitly set via /* 2024-08-22 16:38:56 -07:00
Ishaan Jaff
9e3d573bcb add async_post_call_success_hook 2024-08-22 16:34:43 -07:00
Krish Dholakia
3cf009efbf
Merge pull request #5325 from BerriAI/litellm_redis_cluster
feat(caching.py): redis cluster support
2024-08-22 16:13:45 -07:00
Krrish Dholakia
d7d3eee349 feat(azure.py): support health checking azure deployments
Fixes https://github.com/BerriAI/litellm/issues/5279
2024-08-22 16:11:14 -07:00