Commit graph

16952 commits

Author SHA1 Message Date
Ishaan Jaff
d40695b979 docs custom guardrails 2024-08-23 09:50:19 -07:00
Ishaan Jaff
d10430c881 doc custom guardrail 2024-08-23 09:41:54 -07:00
Ishaan Jaff
a99258440c fix use guardrail for pre call hook 2024-08-23 09:34:08 -07:00
Krrish Dholakia
93ed8c7216 fix(utils.py): handle additionalProperties is False for vertex ai / gemini calls
Fixes https://github.com/BerriAI/litellm/issues/5338

Also adds together ai json mode support
2024-08-23 09:21:32 -07:00
Ishaan Jaff
af92cff44d add custom guardrail reference 2024-08-23 08:32:07 -07:00
Ishaan Jaff
e62d0c7922 add the ability to init a custom guardrail 2024-08-23 08:21:07 -07:00
Krrish Dholakia
874d58fe8a fix(factory.py): support 'add_generation_prompt' field for hf chat templates
Fixes https://github.com/BerriAI/litellm/pull/5178#issuecomment-2306362008
2024-08-23 08:06:21 -07:00
Krrish Dholakia
afb00a27cb fix(litellm_pre_call_utils.py): don't override k-v pair sent in spend_logs_metadata by user 2024-08-23 07:10:18 -07:00
Ishaan Jaff
4ac78a0765 fix prom latency metrics 2024-08-23 06:59:19 -07:00
Krrish Dholakia
52fb1e0c38 docs(caching.md): add redis cluster support to docs 2024-08-22 22:16:07 -07:00
Krrish Dholakia
cee69a92ce docs(bedrock.md): add docs on alternating user/assistant messages 2024-08-22 22:10:10 -07:00
Ishaan Jaff
1765976ce0 use with base64 2024-08-22 20:49:39 -07:00
Ishaan Jaff
ed2d3f4bf8 allow load testing sagemaker url 2024-08-22 20:03:27 -07:00
Krrish Dholakia
3f96a8c667 bump: version 1.44.3 → 1.44.4 2024-08-22 19:31:35 -07:00
Krish Dholakia
0a5fc42f5c
Merge pull request #5337 from BerriAI/litellm_azure_batch_apis
feat(batches): add azure openai batches endpoint support
2024-08-22 19:08:00 -07:00
Krish Dholakia
76b3db334b
Merge branch 'main' into litellm_azure_batch_apis 2024-08-22 19:07:54 -07:00
Ishaan Jaff
b054dd0e45 docs fix 2024-08-22 19:04:14 -07:00
Ishaan Jaff
25609a94ad docs moderation 2024-08-22 18:57:54 -07:00
Krrish Dholakia
56cb94ac5e docs(batches.md): add more examples to docs 2024-08-22 18:51:44 -07:00
Ishaan Jaff
b865993b34 docs move pass thru endpoints 2024-08-22 18:49:26 -07:00
Ishaan Jaff
18cd905a1f ci/cd run again 2024-08-22 18:41:10 -07:00
Ishaan Jaff
95beba870e bump: version 1.44.2 → 1.44.3 2024-08-22 18:27:38 -07:00
Krrish Dholakia
11c7e92b58 docs(sidebars.js): refactor docs 2024-08-22 18:22:50 -07:00
Ishaan Jaff
645da5b437
Merge pull request #5332 from BerriAI/litellm_allow_using_azure_ad_token_auth
[Feat] Azure OpenAI add support for using azure_ad_token_provider with LiteLLM Proxy + Router
2024-08-22 18:21:54 -07:00
Ishaan Jaff
228252b92d
Merge branch 'main' into litellm_allow_using_azure_ad_token_auth 2024-08-22 18:21:24 -07:00
Ishaan Jaff
2bf1fb3275 add test for test_azure_tenant_id_auth 2024-08-22 18:09:00 -07:00
Ishaan Jaff
2d57fab79f
Merge pull request #5339 from BerriAI/litellm_add_bedrock_guardrails
[Feat-Proxy] add bedrock guardrails support
2024-08-22 17:28:57 -07:00
Ishaan Jaff
70f9e41ed9
Merge branch 'main' into litellm_add_bedrock_guardrails 2024-08-22 17:28:49 -07:00
Ishaan Jaff
1f0cc72531 test bedrock guardrails 2024-08-22 17:24:42 -07:00
Krrish Dholakia
e445b78490 docs(configs.md): add global_max_parallel_requests to docs 2024-08-22 17:12:52 -07:00
Krrish Dholakia
735fc804ed fix(proxy_server.py): expose flag to disable retries when max parallel request limit is hit 2024-08-22 16:49:52 -07:00
Krrish Dholakia
7398e94472 fix(files_endpoints.py): fix multiple args error 2024-08-22 16:42:44 -07:00
Krrish Dholakia
d7b525f391 feat(auth_checks.py): allow team to call all models, when explicitly set via /* 2024-08-22 16:38:56 -07:00
Ishaan Jaff
9e3d573bcb add async_post_call_success_hook 2024-08-22 16:34:43 -07:00
Ishaan Jaff
499b6b3368 doc bedrock guardrails 2024-08-22 16:25:22 -07:00
Ishaan Jaff
f13a65be2d fix azure_ad_token_provider 2024-08-22 16:15:53 -07:00
Krish Dholakia
3cf009efbf
Merge pull request #5325 from BerriAI/litellm_redis_cluster
feat(caching.py): redis cluster support
2024-08-22 16:13:45 -07:00
Krrish Dholakia
d7d3eee349 feat(azure.py): support health checking azure deployments
Fixes https://github.com/BerriAI/litellm/issues/5279
2024-08-22 16:11:14 -07:00
Ishaan Jaff
7d55047ab9 add bedrock guardrails support 2024-08-22 16:09:55 -07:00
Krrish Dholakia
63cd94c32a fix: fix linting errors 2024-08-22 15:51:59 -07:00
Krish Dholakia
5b7fbc44be
Merge pull request #5336 from micpst/docs-dbally
docs(projects): add dbally to sidebar
2024-08-22 15:49:16 -07:00
Ishaan Jaff
14a6ce367d add types for BedrockMessage 2024-08-22 15:40:58 -07:00
Krrish Dholakia
8625663458 feat(proxy_server.py): support azure batch api endpoints 2024-08-22 15:21:43 -07:00
Krrish Dholakia
ada426d652 docs(batches.md): add docs on calling azure batches api 2024-08-22 14:51:14 -07:00
Krrish Dholakia
80675b22bd feat(batches): add azure openai batches endpoint support
Closes https://github.com/BerriAI/litellm/issues/5073
2024-08-22 14:46:51 -07:00
Michał Pstrąg
a37f004c1d Merge branch 'main' into docs-dbally 2024-08-22 23:25:57 +02:00
Michał Pstrąg
62df7c755b add dbally project 2024-08-22 23:21:40 +02:00
Ishaan Jaff
8dbcdafe4b
Merge pull request #5335 from BerriAI/litellm_add_metrics_latency
[Feat-Proxy] Prometheus Metrics to Track request latency, track llm api latency
2024-08-22 14:10:19 -07:00
Ishaan Jaff
57707b04b6 add prom docs for Request Latency Metrics 2024-08-22 14:06:14 -07:00
Ishaan Jaff
36b550b8db update promtheus metric names 2024-08-22 14:03:00 -07:00