Commit graph

2778 commits

Author SHA1 Message Date
Ishaan Jaff
12343650e4 feat - allow looking up model_id in model info 2024-07-04 13:13:42 -07:00
Krrish Dholakia
a3e9bc4ae2 fix(aws_secret_manager.py): fix litellm license check 2024-07-03 22:07:48 -07:00
Krish Dholakia
5e47970eed
Merge branch 'main' into litellm_anthropic_tool_calling_streaming_fix 2024-07-03 20:43:51 -07:00
Krrish Dholakia
2e5a81f280 fix(utils.py): stream_options working across all providers 2024-07-03 20:40:46 -07:00
Krrish Dholakia
8dbe0559dd fix(utils.py): stream_options fix 2024-07-03 19:57:34 -07:00
Ishaan Jaff
046b9d8f66 fix lakera ai testing 2024-07-03 18:58:36 -07:00
Ishaan Jaff
1028be6308 feat- control lakera ai per llm call 2024-07-03 16:34:23 -07:00
Ishaan Jaff
80dd14d59e test - default on/off guardrails 2024-07-03 14:50:13 -07:00
Ishaan Jaff
b1e6cee000
Merge branch 'main' into litellm_allow_setting_guardrails_config 2024-07-03 14:21:51 -07:00
Ishaan Jaff
129c2e0c4f init guardrails on proxy 2024-07-03 14:18:12 -07:00
Ishaan Jaff
50443d3d48 fix checks on litellm license 2024-07-03 14:02:07 -07:00
Krrish Dholakia
c6962bda80 fix(user_api_key_auth.py): expose /health as an info route
this still requires key access. Allows users to check model health
2024-07-03 10:02:50 -07:00
Krrish Dholakia
606d04b05b fix(_service_logging.py): only trigger otel if in service_callback
Fixes https://github.com/BerriAI/litellm/issues/4511
2024-07-03 09:48:38 -07:00
Krish Dholakia
21d3a28e51
Merge branch 'main' into litellm_support_dynamic_rpm_limiting 2024-07-02 17:51:18 -07:00
Ishaan Jaff
174b2b69df
Merge pull request #4518 from BerriAI/litellm_fix_background_health_checks
[Fix-Proxy] Background health checks use deep copy of model list for _run_background_health_check
2024-07-02 16:42:34 -07:00
Krrish Dholakia
66c6992f8a fix(slack_alerting.py): use in-memory cache for checking request status 2024-07-02 13:01:59 -07:00
Ishaan Jaff
cd6b121642 use deep copy of router for _run_background_health_check 2024-07-02 11:29:24 -07:00
Krrish Dholakia
5aae2313f3 fix(aws_secret_manager.py): fix string replace 2024-07-02 00:42:12 -07:00
Krrish Dholakia
196b94455e fix(dynamic_rate_limiter.py): add rpm allocation, priority + quota reservation to docs 2024-07-01 23:35:42 -07:00
Krish Dholakia
011e14eb08
Merge branch 'main' into litellm_disable_cooldowns 2024-07-01 23:10:10 -07:00
Krrish Dholakia
6b529d4e0e fix(dynamic_rate_limiter.py): support setting priority + reserving tpm/rpm 2024-07-01 23:08:54 -07:00
Ishaan Jaff
1c194f0275
Merge pull request #4501 from BerriAI/litellm_return_Response_headers
[Feat] Return Response headers for OpenAI / Azure OpenAI when `litellm.return_response_headers=True`
2024-07-01 21:11:10 -07:00
Krrish Dholakia
0781014706 test(test_dynamic_rate_limit_handler.py): refactor tests for rpm suppprt 2024-07-01 20:16:10 -07:00
Krrish Dholakia
f23b17091d fix(dynamic_rate_limiter.py): support dynamic rate limiting on rpm 2024-07-01 17:45:10 -07:00
Krrish Dholakia
b761ce9dd1 fix(s3.py): fix s3 path logging on proxy 2024-07-01 17:22:31 -07:00
Ishaan Jaff
48946f7528 fix config 2024-07-01 17:02:15 -07:00
Krrish Dholakia
ea74e01813 fix(router.py): disable cooldowns
allow admin to disable model cooldowns
2024-07-01 15:03:10 -07:00
Krrish Dholakia
bfab76a810 docs(configs.md): add wildcard model name to docs 2024-07-01 14:11:45 -07:00
Krrish Dholakia
223494a0e0 fix(aws_secret_manager.py): accept 'aws_kms' being in the key name 2024-07-01 13:46:20 -07:00
Ishaan Jaff
4910029eeb
Merge pull request #4479 from BerriAI/litellm_fix_admin_ui_login_issue
[Fix] Admin UI - fix error users we're seeing when logging in (use correct  user_id  when creating key for admin ui)
2024-06-29 20:00:45 -07:00
Ishaan Jaff
9eee36b449 add better debugging on /spend/report 2024-06-29 18:01:25 -07:00
Ishaan Jaff
22ad5b407e
Merge pull request #4474 from BerriAI/litellm_store_full_exception_inDB
[Fix] Proxy ErrorLogs store raw exception in error log
2024-06-29 17:46:24 -07:00
Ishaan Jaff
37352ba692 fix - use correct user_id when creating key for admin ui 2024-06-29 14:36:58 -07:00
Ishaan Jaff
c7e89a571e fix error logs store exception in DB 2024-06-29 12:15:14 -07:00
Ishaan Jaff
3a5d258e05 raise error on /spend/report endpoint 2024-06-29 12:00:19 -07:00
Ishaan Jaff
b09c283cc4 feat - add spend report grouped by api key 2024-06-29 11:33:02 -07:00
Krrish Dholakia
4f32f283a3 fix(vertex_httpx.py): fix streaming for cloudflare proxy calls 2024-06-29 09:09:56 -07:00
Ishaan Jaff
2a7592d026 doc - pass through auth 2024-06-29 09:09:18 -07:00
Ishaan Jaff
c578816434 feat - setting up auth on pass through endpoint 2024-06-29 08:38:44 -07:00
Krrish Dholakia
9556bfda81 fix(aws_secret_manager.py): fix typing error 2024-06-28 21:56:14 -07:00
Krish Dholakia
d0c89ddbe3
Merge pull request #4437 from BerriAI/litellm_aws_kms_decryption
fix(initial-commit): decrypts aws keys in entrypoint.sh
2024-06-28 21:10:52 -07:00
Ishaan Jaff
bc1c96ca35 pass through langfuse "/api/public/ingestion" 2024-06-28 17:29:11 -07:00
Ishaan Jaff
40d9278dcb test - pass through langfuse requests 2024-06-28 17:28:21 -07:00
Ishaan Jaff
6af1293384 feat - pass through langfuse requests 2024-06-28 17:27:13 -07:00
Ishaan Jaff
d0d1efcbdf
Merge pull request #4463 from BerriAI/docs_create_pass_through_routes_litellm_proxy
Docs create pass through routes litellm proxy (tutorial setup cohere Re-Rank Endpoint)
2024-06-28 16:55:44 -07:00
Krrish Dholakia
b78dd6416a fix(prisma_migration.py): support decrypting variables in a python script 2024-06-28 16:31:37 -07:00
Krrish Dholakia
b84d335624 fix(proxy_cli.py): run aws kms decrypt before starting proxy server 2024-06-28 16:03:56 -07:00
Ishaan Jaff
8f2931937a fix use os.environ/ vars for pass through endpoints 2024-06-28 15:30:31 -07:00
Ishaan Jaff
954c6ec9ed fix support pass through endpoints 2024-06-28 15:06:51 -07:00
Ishaan Jaff
f52cc18adb feat - support pass through endpoints 2024-06-28 15:03:21 -07:00