Commit graph

2254 commits

Author SHA1 Message Date
Krish Dholakia
d25ed9c4d3
Merge pull request #3828 from BerriAI/litellm_outage_alerting
fix(slack_alerting.py): support region based outage alerting
2024-05-24 19:13:17 -07:00
Krrish Dholakia
8dec87425e feat(slack_alerting.py): refactor region outage alerting to do model based alerting instead
Unable to extract azure region from api base, makes sense to start with model alerting and then move to region
2024-05-24 19:10:33 -07:00
Ishaan Jaff
f6d7d0e520 fix - update webhook event validation 2024-05-24 18:31:10 -07:00
Ishaan Jaff
db54599222 feat - send email on api key created 2024-05-24 18:14:45 -07:00
Ishaan Jaff
41879ae002 feat - email alerts on /key/generate 2024-05-24 17:33:15 -07:00
Ishaan Jaff
fa49833cdf feat - send email on new key created 2024-05-24 17:30:06 -07:00
Krrish Dholakia
f8350b9461 fix(slack_alerting.py): support region based outage alerting 2024-05-24 16:59:16 -07:00
Ishaan Jaff
36916852ad fix - user_id_information 2024-05-24 15:37:36 -07:00
Ishaan Jaff
4f4607a4df fix - send email alert when token crossed it's budget 2024-05-24 15:22:28 -07:00
Ishaan Jaff
c3dd0a1470 feat check SMTP_SENDER_EMAIL + put behind enterprise license 2024-05-24 14:37:53 -07:00
Ishaan Jaff
268d4f7665 fix send_email update 2024-05-24 14:23:27 -07:00
Krrish Dholakia
2af4948af4 feat(proxy_server.py): add setup instructions for UI 2024-05-24 08:15:44 -07:00
Krrish Dholakia
bc399029d8 build(ui): updating admin ui with changes 2024-05-23 20:34:48 -07:00
Krrish Dholakia
f04e4b921b feat(ui/model_dashboard.tsx): add databricks models via admin ui 2024-05-23 20:28:54 -07:00
Ishaan Jaff
30ac77532b feat - enfore enterprise checks 2024-05-23 16:11:30 -07:00
Ishaan Jaff
580a342fdf
Merge pull request #3803 from BerriAI/litellm_add_lakera_ai
[Feat] Add Lakera AI Prompt Injection Detection
2024-05-23 16:01:24 -07:00
Ishaan Jaff
769070b3fe
Merge pull request #3797 from BerriAI/litellm_fix_post_call_streaming_hooks
[Fix]  async_post_call_streaming_hook not triggered on proxy server
2024-05-23 15:35:47 -07:00
Ishaan Jaff
ec41226ee8 feat - add lakera ai prompt injection detection 2024-05-23 15:16:46 -07:00
Ishaan Jaff
ea76432844
Merge pull request #3802 from BerriAI/litellm_add_moderations_safety_check_proxy
[Feat]- Proxy Add OpenAI Content Moderation Pre call hook
2024-05-23 14:30:49 -07:00
Ishaan Jaff
1fe035c6dd feat - add open ai moderations check 2024-05-23 13:08:06 -07:00
Krrish Dholakia
54591e3920 test(test_key_generate_prisma.py): fix tests with unique team id 2024-05-23 12:55:26 -07:00
Ishaan Jaff
5291f380c9 feat - async_post_call_streaming_hook 2024-05-23 09:30:53 -07:00
Krrish Dholakia
a33b9221da fix(proxy_server.py): fix admin allowed routes 2024-05-22 20:55:21 -07:00
Ishaan Jaff
9ba7d26597 feat - raise exception when creating team_id for exiting team_id 2024-05-22 20:26:56 -07:00
Krrish Dholakia
851f26b759 fix(slack_alerting.py): fix time check + add more debug values 2024-05-22 20:11:36 -07:00
Ishaan Jaff
288303f797 (ci/cd) run again - new ui build 2024-05-22 20:04:10 -07:00
Ishaan Jaff
a8b64a01dc
Merge pull request #3790 from BerriAI/litellm_set_team_member_budgets
[Feat] Set Budgets for Users within a Team
2024-05-22 19:44:04 -07:00
Krrish Dholakia
8d1067c81c fix(time_to_first_token.tsx): require enterprise license for usage 2024-05-22 19:20:30 -07:00
Ishaan Jaff
c535576af5 feat - update existing_team_member_spend in memory 2024-05-22 18:44:44 -07:00
Ishaan Jaff
7a8d78e6c9 feat - check team member spend, budget 2024-05-22 18:36:39 -07:00
Krish Dholakia
3400596dd2
Merge pull request #3787 from BerriAI/litellm_set_budgets_ui_2
feat(proxy_server.py): enable admin to set tpm/rpm limits for end-users via UI
2024-05-22 18:22:51 -07:00
Krish Dholakia
5a3aca10ce
Merge pull request #3789 from BerriAI/litellm_ttft_ui
feat(schema.prisma): store model id + model group as part of spend logs allows precise model metrics
2024-05-22 18:22:39 -07:00
Krrish Dholakia
259003ad8c fix(proxy_server.py): fix model metrics to return slow responses based on model group 2024-05-22 18:21:27 -07:00
Krrish Dholakia
0f2c55dd81 feat(ui/time_to_first_token.tsx): add time to first token card to model metrics 2024-05-22 18:09:53 -07:00
Ishaan Jaff
c2d25b9a14 feat - update team+user_id spend 2024-05-22 17:49:54 -07:00
Krrish Dholakia
3c0e9cb48e feat(schema.prisma): store model id + model group as part of spend logs
allows precise model metrics
2024-05-22 17:29:44 -07:00
Ishaan Jaff
f548334e8b build - prisma schema change 2024-05-22 17:19:31 -07:00
Ishaan Jaff
50461eb22c feat - create budgets when team/member_add 2024-05-22 17:16:19 -07:00
Ishaan Jaff
31fc6d79af
Merge pull request #3785 from BerriAI/litellm_end_user_rate_limits
[Feat] LiteLLM Proxy: Enforce End-User TPM, RPM Limits
2024-05-22 17:12:58 -07:00
Krrish Dholakia
9f518030b9 build(schema.prisma): add completion_start_time to spend logs
enables time to first token tracking for models
2024-05-22 16:43:08 -07:00
Krrish Dholakia
7f0d85f12a feat(proxy_server.py): enable admin to create/delete budgets for end-users 2024-05-22 16:20:06 -07:00
Ishaan Jaff
42078ac285 fix - run tpm / rpm checks on proxy admin keys too 2024-05-22 16:15:09 -07:00
Ishaan Jaff
e6b406d739 feat - enforce end user tpm / rpm limits 2024-05-22 15:45:30 -07:00
Ishaan Jaff
106910cecf feat - add end user rate limiting 2024-05-22 14:01:57 -07:00
Ishaan Jaff
eac7e70dca feat - include litellm_budget table when getting end_user 2024-05-22 13:30:08 -07:00
Krrish Dholakia
9609df16d3 feat(proxy_server.py): new 'add budget' endpoint
create a budget object (max budget, tpm /rpm limits, etc.) and assign that to a user/team/etc.
2024-05-22 13:16:37 -07:00
Ishaan Jaff
b81fcf2482 feat -add failure callbacks from DB to proxy 2024-05-21 22:00:36 -07:00
Ishaan Jaff
64130e368c feat - create keys with permissions 2024-05-21 18:04:17 -07:00
Ishaan Jaff
829ca84f07
Merge pull request #3750 from BerriAI/litellm_raise_exception_when_deleting_updating_team
[Fix] - raise Exception when trying to update/delete a non-existent team
2024-05-20 18:42:27 -07:00
Krish Dholakia
707cf24472
Merge branch 'main' into litellm_webhook_support 2024-05-20 18:41:58 -07:00