Commit graph

456 commits

Author SHA1 Message Date
Marc Abramowitz
642f1a7bcc Check existence of multiple views in 1 query
instead of multiple queries. This is more efficient because it lets us check for
all views in one query instead of multiple queries.
2024-07-23 21:55:02 -07:00
Ishaan Jaff
b137207ae6 doc alert_to_webhook_url 2024-07-23 11:04:15 -07:00
Ishaan Jaff
c1593c0cd1 update alert_to_webhook_url 2024-07-23 10:07:08 -07:00
Krish Dholakia
c4db6aa15e
Merge pull request #4810 from BerriAI/litellm_team_modify_guardrails
feat(auth_checks.py): Allow admin to disable team from turning on/off guardrails
2024-07-22 22:32:24 -07:00
Krrish Dholakia
1a83935aa4 fix(proxy/utils.py): add stronger typing for litellm params in failure call logging 2024-07-22 21:31:39 -07:00
Ishaan Jaff
c9e2f977dd feat - return team_metadata in user_api_key_auth 2024-07-22 18:21:50 -07:00
Krrish Dholakia
8b3c8102a7 feat(auth_checks.py): Allow admin to disable team from turning on/off guardrails. 2024-07-20 18:39:05 -07:00
Krrish Dholakia
99aa311083 fix(user_api_key_auth.py): update team values in token cache if refreshed more recently 2024-07-19 17:35:59 -07:00
Ishaan Jaff
df1ac92222 fix add fix to update spend logs 2024-07-19 12:49:23 -07:00
Ishaan Jaff
3736152e7d fix calculate correct alerting threshold 2024-07-16 21:43:17 -07:00
Ishaan Jaff
6c918f2373 fix tracking hanging requests 2024-07-16 21:43:16 -07:00
Ishaan Jaff
36be9967d1 fix storing request status in mem 2024-07-16 21:43:16 -07:00
Krrish Dholakia
ec03e675c9 fix(proxy/utils.py): fix failure logging for rejected requests. + unit tests 2024-07-16 17:15:20 -07:00
Ishaan Jaff
69f74c1e6c fix only use per request tracking if slack alerting is being used 2024-07-13 18:01:53 -07:00
Krrish Dholakia
0cc273d77b feat(pass_through_endpoint.py): support enforcing key rpm limits on pass through endpoints
Closes https://github.com/BerriAI/litellm/issues/4698
2024-07-13 13:29:44 -07:00
Ishaan Jaff
92bf98b30f
Merge pull request #4684 from BerriAI/litellm_safe_memory_mode
[Feat] Allow safe memory mode
2024-07-12 18:32:16 -07:00
Ishaan Jaff
08efef5316 feat add safe_memory_mode 2024-07-12 18:18:39 -07:00
Ishaan Jaff
8c8dcdbdb1 reduce ttil for update_request_status 2024-07-12 15:14:54 -07:00
Ishaan Jaff
3a06e2e425 fix show exact prisma exception when starting proxy 2024-07-09 18:20:09 -07:00
Krrish Dholakia
20e39d6acc fix(utils.py): cleanup 'additionalProperties=False' for tool calling with zod
Fixes issue with zod passing in additionalProperties=False, causing vertex ai / gemini calls to fail
2024-07-06 17:27:37 -07:00
Ishaan Jaff
d61cc598b0
Merge pull request #4576 from BerriAI/litellm_encrypt_decrypt_using_salt
[Refactor] Use helper function to encrypt/decrypt model credentials
2024-07-06 15:11:09 -07:00
Ishaan Jaff
7f6f85ec3c improve sign up flow - show missing env vars 2024-07-06 13:57:19 -07:00
Krrish Dholakia
71edb28d6a fix(proxy_server.py): fix embedding model exception mapping 2024-07-06 11:14:41 -07:00
Ishaan Jaff
8f0b55879f move encrypt / decrypt to helper 2024-07-06 11:09:47 -07:00
Krrish Dholakia
6090fd37fa fix(test_proxy_reject_logging.py): fix test 2024-07-05 19:09:37 -07:00
Krrish Dholakia
fd11f63f71 fix(utils.py): log failure to sync failure callbacks as well 2024-07-05 14:49:34 -07:00
Krrish Dholakia
b1b21b0340 fix(proxy/utils.py): support logging rejected requests to langfuse, etc. 2024-07-05 14:39:35 -07:00
Krrish Dholakia
d528b66db0 fix(proxy_server.py): fix callback check order 2024-07-05 14:06:33 -07:00
Krrish Dholakia
7618ec43b3 fix(proxy_server.py): support langfuse logging for rejected requests on /v1/chat/completions 2024-07-05 13:07:09 -07:00
Krrish Dholakia
66c6992f8a fix(slack_alerting.py): use in-memory cache for checking request status 2024-07-02 13:01:59 -07:00
Krish Dholakia
961e7ac95d
Merge branch 'main' into litellm_dynamic_tpm_limits 2024-06-22 19:14:59 -07:00
Krrish Dholakia
a31a05d45d feat(dynamic_rate_limiter.py): working e2e 2024-06-22 14:41:22 -07:00
Krrish Dholakia
532f24bfb7 refactor: instrument 'dynamic_rate_limiting' callback on proxy 2024-06-22 00:32:29 -07:00
Krish Dholakia
f192b76ed4
Merge pull request #4344 from BerriAI/litellm_refactor_langfuse_slack_trace_url
refactor(litellm_logging.py): refactors how slack_alerting generates langfuse trace url
2024-06-21 23:37:38 -07:00
Ishaan Jaff
29c2155796 fix cost tracking by tags 2024-06-21 16:49:57 -07:00
Krrish Dholakia
2584120012 refactor(litellm_logging.py): refactors how slack_alerting generates langfuse trace url
gets the url from logging object
2024-06-21 16:12:25 -07:00
Krrish Dholakia
941574a921 fix(proxy/utils.py): fix add langfuse trace id to alert
Fixing the import after refactor
2024-06-21 14:55:09 -07:00
Krrish Dholakia
5912decd19 fix(proxy/utils.py): fix bool on check 2024-06-21 14:29:38 -07:00
Krrish Dholakia
df3afa652f fix(proxy/utils.py): fix linting error 2024-06-20 14:13:38 -07:00
Krrish Dholakia
2623bb260f fix(litellm_logging.py): fix lago callback logic 2024-06-17 09:10:19 -07:00
Krish Dholakia
f2b3770869
Merge pull request #4216 from BerriAI/litellm_refactor_logging
refactor(utils.py): Cut down utils.py to <10k lines.
2024-06-15 15:19:42 -07:00
Krrish Dholakia
af88e55d46 fix(utils.py): move 'set_callbacks' to litellm_logging.py 2024-06-15 12:02:30 -07:00
Krrish Dholakia
4f91205530 refactor(utils.py): refactor Logging to it's own class. Cut down utils.py to <10k lines.
Easier debugging

 Reference: https://github.com/BerriAI/litellm/issues/4206
2024-06-15 10:57:20 -07:00
Ishaan Jaff
da3ae00bd6 fix - proxy refactor user_api_key_auth 2024-06-15 10:33:58 -07:00
Ishaan Jaff
4bfd46d0a6
Merge pull request #4209 from BerriAI/litellm_send_email_alerts_budget_exceeded
[Feat] send email alerts when budget exceeded
2024-06-14 20:23:19 -07:00
Ishaan Jaff
52b54da354 fix -better debugging before sending emails 2024-06-14 17:38:33 -07:00
Krrish Dholakia
e18002023a fix(slack_alerting.py): allow new 'alerting_metadata' arg
Allows user to pass in additional alerting metadata for debugging
2024-06-14 16:06:47 -07:00
Krrish Dholakia
d8acda9f39 fix(proxy/utils.py): fix reset monthly budget
fix to reset at the same time each month (not at start of month)
2024-06-14 14:41:06 -07:00
Krish Dholakia
e61cd2e1e2
Merge branch 'main' into litellm_redis_cache_usage 2024-06-13 22:07:21 -07:00
Krrish Dholakia
f65752c18b feat(proxy/utils.py): allow budget duration in months
Closes https://github.com/BerriAI/litellm/issues/4042
2024-06-13 16:52:17 -07:00