Commit graph

3536 commits

Author SHA1 Message Date
Ishaan Jaff
39efd58b1e docs on using vertex tts 2024-08-23 17:57:49 -07:00
Krrish Dholakia
26bd9a409a fix(proxy_server.py): support env vars for controlling global max parallel request retry/timeouts
fixes issue where litellm module level settings weren't working for global retries, due to time of init
2024-08-23 16:06:08 -07:00
Ishaan Jaff
54b4fabf31 Merge pull request #5342 from BerriAI/litellm_add_custom_guardrails
[Feat-Proxy] Add Custom Guardrails
2024-08-23 12:11:31 -07:00
Ishaan Jaff
ba20c6a35c feat add test for custom guardrails 2024-08-23 12:01:43 -07:00
Ishaan Jaff
dcd39dac00 init custom guardrail class 2024-08-23 10:54:42 -07:00
Ishaan Jaff
7e064f2dcd custom_callbacks 2024-08-23 09:52:52 -07:00
Ishaan Jaff
a8e192a868 fix use guardrail for pre call hook 2024-08-23 09:34:08 -07:00
Krrish Dholakia
2a6aa6da7a fix(utils.py): handle additionalProperties is False for vertex ai / gemini calls
Fixes https://github.com/BerriAI/litellm/issues/5338

Also adds together ai json mode support
2024-08-23 09:21:32 -07:00
Ishaan Jaff
6e3f27cf69 add custom guardrail reference 2024-08-23 08:32:07 -07:00
Ishaan Jaff
65e222b40f add the ability to init a custom guardrail 2024-08-23 08:21:07 -07:00
Krrish Dholakia
fa609c6d54 fix(litellm_pre_call_utils.py): don't override k-v pair sent in spend_logs_metadata by user 2024-08-23 07:10:18 -07:00
Krish Dholakia
11cbf60e4f Merge branch 'main' into litellm_azure_batch_apis 2024-08-22 19:07:54 -07:00
Ishaan Jaff
2864d16fa1 Merge branch 'main' into litellm_allow_using_azure_ad_token_auth 2024-08-22 18:21:24 -07:00
Ishaan Jaff
c23cf18a70 Merge branch 'main' into litellm_add_bedrock_guardrails 2024-08-22 17:28:49 -07:00
Ishaan Jaff
550da1153e test bedrock guardrails 2024-08-22 17:24:42 -07:00
Krrish Dholakia
b0706a6f8f fix(proxy_server.py): expose flag to disable retries when max parallel request limit is hit 2024-08-22 16:49:52 -07:00
Krrish Dholakia
b95d4bb61a fix(files_endpoints.py): fix multiple args error 2024-08-22 16:42:44 -07:00
Krrish Dholakia
73a5921262 feat(auth_checks.py): allow team to call all models, when explicitly set via /* 2024-08-22 16:38:56 -07:00
Ishaan Jaff
0a74738112 add async_post_call_success_hook 2024-08-22 16:34:43 -07:00
Krish Dholakia
e9928a01ad Merge pull request #5325 from BerriAI/litellm_redis_cluster
feat(caching.py): redis cluster support
2024-08-22 16:13:45 -07:00
Krrish Dholakia
b7f42c96a4 feat(azure.py): support health checking azure deployments
Fixes https://github.com/BerriAI/litellm/issues/5279
2024-08-22 16:11:14 -07:00
Ishaan Jaff
2f01a22ef7 add bedrock guardrails support 2024-08-22 16:09:55 -07:00
Krrish Dholakia
5959cf0b50 fix: fix linting errors 2024-08-22 15:51:59 -07:00
Krrish Dholakia
f9ab33cbc2 feat(proxy_server.py): support azure batch api endpoints 2024-08-22 15:21:43 -07:00
Ishaan Jaff
c719c375f7 track litellm_request_latency_metric 2024-08-22 13:58:10 -07:00
Ishaan Jaff
b16752f0bc add new litellm params for client_id, tenant_id etc 2024-08-22 11:37:30 -07:00
Krish Dholakia
f87f3987bd Merge branch 'main' into litellm_redis_cluster 2024-08-22 11:06:14 -07:00
Ishaan Jaff
525d152d85 use azure_ad_token_provider to init clients 2024-08-22 11:03:49 -07:00
Ishaan Jaff
d2dd40e1d2 fix allow setting LiteLLM license as .env 2024-08-22 10:05:00 -07:00
Ishaan Jaff
468bf7c615 fix allow setting license in config.yaml 2024-08-22 09:45:15 -07:00
Ishaan Jaff
a067f0f926 add docstring for /embeddings and /completions 2024-08-22 09:30:47 -07:00
Ishaan Jaff
2caee891e0 add doc string for /chat/completions swagger 2024-08-22 09:27:40 -07:00
Ishaan Jaff
0de3f615ca fix /user/delete doc string 2024-08-22 09:09:51 -07:00
Ishaan Jaff
b0b5400cde fix team_member_add 2024-08-21 19:10:37 -07:00
Ishaan Jaff
5ffee5875c fix test_master_key_hashing 2024-08-21 17:56:09 -07:00
Ishaan Jaff
2baa6eb5e1 use litellm proxy with vertex ai sdk 2024-08-21 17:47:01 -07:00
Ishaan Jaff
56057f278a Merge branch 'main' into litellm_pass_through_vtx_multi_modal 2024-08-21 17:23:22 -07:00
Ishaan Jaff
c20cb7b586 fix pass through endpoints 2024-08-21 17:21:22 -07:00
Ishaan Jaff
bfb5136489 refactor vertex endpoints to pass through all routes 2024-08-21 17:08:42 -07:00
Ishaan Jaff
376165aa52 add test vtx embedding 2024-08-21 17:05:47 -07:00
Ishaan Jaff
152ebe2e7f proxy - print embedding request when recieved 2024-08-21 17:00:18 -07:00
Krrish Dholakia
d6bc37374e feat(litellm_logging.py): add 'saved_cache_cost' to standard logging payload (s3) 2024-08-21 16:58:07 -07:00
Krrish Dholakia
0707d8d815 docs(users.md): add doc on setting max budget for internal users 2024-08-21 16:21:40 -07:00
Krish Dholakia
3971880af4 Merge branch 'main' into litellm_disable_storing_master_key_hash_in_db 2024-08-21 15:37:25 -07:00
Krrish Dholakia
33c9c16388 feat(caching.py): redis cluster support
Closes https://github.com/BerriAI/litellm/issues/4358
2024-08-21 15:01:52 -07:00
Krish Dholakia
a583b95d85 Merge pull request #5308 from BerriAI/litellm_team_admin_permissions
feat(user_api_key_auth.py): allow team admin to add new members to team
2024-08-21 14:21:22 -07:00
Krrish Dholakia
ac5c6c8751 fix(litellm_pre_call_utils.py): handle dynamic keys via api correctly 2024-08-21 13:37:21 -07:00
Ishaan Jaff
b522ade709 docs semantic caching qdrant 2024-08-21 13:03:41 -07:00
Krrish Dholakia
35a9f328c7 fix(internal_user_endpoints.py): pass in user api key dict value 2024-08-21 12:46:43 -07:00
Ishaan Jaff
8c83fb3f34 fixes for using qdrant with litellm proxy 2024-08-21 12:36:41 -07:00