Commit graph

424 commits

Author SHA1 Message Date
Ishaan Jaff
b6009233ac fix always read redis 2024-09-02 21:08:32 -07:00
Krrish Dholakia
74081d31f6 fix(proxy/utils.py): fix model dump to exclude none values 2024-08-28 12:02:44 -07:00
Ishaan Jaff
359a003ac8 v0 add rerank on litellm proxy 2024-08-27 17:28:39 -07:00
Ishaan Jaff
a8e192a868 fix use guardrail for pre call hook 2024-08-23 09:34:08 -07:00
Ishaan Jaff
6e3f27cf69 add custom guardrail reference 2024-08-23 08:32:07 -07:00
Krrish Dholakia
64affd0d6b feat(user_api_key_auth.py): allow team admin to add new members to team 2024-08-20 14:01:12 -07:00
Ishaan Jaff
249df0a78e run during_call_hook 2024-08-19 12:07:46 -07:00
Ishaan Jaff
b4bca8db82 feat - allow accessing data post success call 2024-08-19 11:35:33 -07:00
Ishaan Jaff
0fc717fd0a add fallback_reports as slack alert 2024-08-10 15:26:32 -07:00
Ishaan Jaff
4c4cc1ba23 Merge pull request #5058 from BerriAI/litellm_add_debug_statements_connecting_prisma
Fix - add debug statements when connecting to prisma DB
2024-08-05 20:53:55 -07:00
Ishaan Jaff
5dc04d8479 Merge pull request #5059 from BerriAI/litelm_log_otel_args
OTEL - Log DB queries / functions on OTEL
2024-08-05 20:51:11 -07:00
Ishaan Jaff
bf72b7e567 otel fix async_service_failure_hook 2024-08-05 20:18:40 -07:00
Ishaan Jaff
b80629b828 log event_metadata on otel service loggers 2024-08-05 19:58:55 -07:00
Krrish Dholakia
4ba576724c test: improve debugging for test 2024-08-05 19:41:08 -07:00
Ishaan Jaff
6fd1685106 add debug statements when connected to prisma db 2024-08-05 19:13:46 -07:00
Ishaan Jaff
aade38760d use common helpers for writing to otel 2024-07-27 11:40:39 -07:00
Ishaan Jaff
bb7fc3e426 use _get_parent_otel_span_from_kwargs 2024-07-27 11:14:06 -07:00
Krrish Dholakia
487035c970 feat(auth_check.py): support using redis cache for team objects
Allows team update / check logic to work across instances instantly
2024-07-24 18:14:49 -07:00
Marc Abramowitz
1a15755cc3 Check existence of multiple views in 1 query
instead of multiple queries. This is more efficient because it lets us check for
all views in one query instead of multiple queries.
2024-07-23 21:55:02 -07:00
Ishaan Jaff
abc6da3775 doc alert_to_webhook_url 2024-07-23 11:04:15 -07:00
Ishaan Jaff
f2cbdf2336 update alert_to_webhook_url 2024-07-23 10:07:08 -07:00
Krish Dholakia
0cb872cbea Merge pull request #4810 from BerriAI/litellm_team_modify_guardrails
feat(auth_checks.py): Allow admin to disable team from turning on/off guardrails
2024-07-22 22:32:24 -07:00
Krrish Dholakia
14c5d6ff03 fix(proxy/utils.py): add stronger typing for litellm params in failure call logging 2024-07-22 21:31:39 -07:00
Ishaan Jaff
cd2bdfd2fb feat - return team_metadata in user_api_key_auth 2024-07-22 18:21:50 -07:00
Krrish Dholakia
a351b7cc3e feat(auth_checks.py): Allow admin to disable team from turning on/off guardrails. 2024-07-20 18:39:05 -07:00
Krrish Dholakia
b92af48854 fix(user_api_key_auth.py): update team values in token cache if refreshed more recently 2024-07-19 17:35:59 -07:00
Ishaan Jaff
75e48c84f4 fix add fix to update spend logs 2024-07-19 12:49:23 -07:00
Ishaan Jaff
3203d825e0 fix calculate correct alerting threshold 2024-07-16 21:43:17 -07:00
Ishaan Jaff
f1a3fd99b7 fix tracking hanging requests 2024-07-16 21:43:16 -07:00
Ishaan Jaff
4eef814a35 fix storing request status in mem 2024-07-16 21:43:16 -07:00
Krrish Dholakia
b022099712 fix(proxy/utils.py): fix failure logging for rejected requests. + unit tests 2024-07-16 17:15:20 -07:00
Ishaan Jaff
36b24209eb fix only use per request tracking if slack alerting is being used 2024-07-13 18:01:53 -07:00
Krrish Dholakia
1d6643df22 feat(pass_through_endpoint.py): support enforcing key rpm limits on pass through endpoints
Closes https://github.com/BerriAI/litellm/issues/4698
2024-07-13 13:29:44 -07:00
Ishaan Jaff
1adff9cbd6 Merge pull request #4684 from BerriAI/litellm_safe_memory_mode
[Feat] Allow safe memory mode
2024-07-12 18:32:16 -07:00
Ishaan Jaff
c43948545f feat add safe_memory_mode 2024-07-12 18:18:39 -07:00
Ishaan Jaff
bc7b3f28b9 reduce ttil for update_request_status 2024-07-12 15:14:54 -07:00
Ishaan Jaff
49f8894dcc fix show exact prisma exception when starting proxy 2024-07-09 18:20:09 -07:00
Krrish Dholakia
1dae0a5b6a fix(utils.py): cleanup 'additionalProperties=False' for tool calling with zod
Fixes issue with zod passing in additionalProperties=False, causing vertex ai / gemini calls to fail
2024-07-06 17:27:37 -07:00
Ishaan Jaff
f96c0efd90 Merge pull request #4576 from BerriAI/litellm_encrypt_decrypt_using_salt
[Refactor] Use helper function to encrypt/decrypt model credentials
2024-07-06 15:11:09 -07:00
Ishaan Jaff
752fe3ac7c improve sign up flow - show missing env vars 2024-07-06 13:57:19 -07:00
Krrish Dholakia
47ce6ccac0 fix(proxy_server.py): fix embedding model exception mapping 2024-07-06 11:14:41 -07:00
Ishaan Jaff
561a30dd59 move encrypt / decrypt to helper 2024-07-06 11:09:47 -07:00
Krrish Dholakia
faf11c3a3e fix(test_proxy_reject_logging.py): fix test 2024-07-05 19:09:37 -07:00
Krrish Dholakia
fe889c47db fix(utils.py): log failure to sync failure callbacks as well 2024-07-05 14:49:34 -07:00
Krrish Dholakia
deb7a86e9c fix(proxy/utils.py): support logging rejected requests to langfuse, etc. 2024-07-05 14:39:35 -07:00
Krrish Dholakia
9f039a9776 fix(proxy_server.py): fix callback check order 2024-07-05 14:06:33 -07:00
Krrish Dholakia
56410cfcd0 fix(proxy_server.py): support langfuse logging for rejected requests on /v1/chat/completions 2024-07-05 13:07:09 -07:00
Krrish Dholakia
d09a78d7fd fix(slack_alerting.py): use in-memory cache for checking request status 2024-07-02 13:01:59 -07:00
Krish Dholakia
63d0defa6d Merge branch 'main' into litellm_dynamic_tpm_limits 2024-06-22 19:14:59 -07:00
Krrish Dholakia
8843b0dc77 feat(dynamic_rate_limiter.py): working e2e 2024-06-22 14:41:22 -07:00