Commit graph

1384 commits

Author SHA1 Message Date
Krrish Dholakia
8dec87425e feat(slack_alerting.py): refactor region outage alerting to do model based alerting instead
Unable to extract azure region from api base, makes sense to start with model alerting and then move to region
2024-05-24 19:10:33 -07:00
Krrish Dholakia
f04e4b921b feat(ui/model_dashboard.tsx): add databricks models via admin ui 2024-05-23 20:28:54 -07:00
Ishaan Jaff
30ac77532b feat - enfore enterprise checks 2024-05-23 16:11:30 -07:00
Ishaan Jaff
580a342fdf
Merge pull request #3803 from BerriAI/litellm_add_lakera_ai
[Feat] Add Lakera AI Prompt Injection Detection
2024-05-23 16:01:24 -07:00
Ishaan Jaff
769070b3fe
Merge pull request #3797 from BerriAI/litellm_fix_post_call_streaming_hooks
[Fix]  async_post_call_streaming_hook not triggered on proxy server
2024-05-23 15:35:47 -07:00
Ishaan Jaff
ec41226ee8 feat - add lakera ai prompt injection detection 2024-05-23 15:16:46 -07:00
Ishaan Jaff
1fe035c6dd feat - add open ai moderations check 2024-05-23 13:08:06 -07:00
Ishaan Jaff
5291f380c9 feat - async_post_call_streaming_hook 2024-05-23 09:30:53 -07:00
Krrish Dholakia
a33b9221da fix(proxy_server.py): fix admin allowed routes 2024-05-22 20:55:21 -07:00
Ishaan Jaff
9ba7d26597 feat - raise exception when creating team_id for exiting team_id 2024-05-22 20:26:56 -07:00
Ishaan Jaff
a8b64a01dc
Merge pull request #3790 from BerriAI/litellm_set_team_member_budgets
[Feat] Set Budgets for Users within a Team
2024-05-22 19:44:04 -07:00
Krrish Dholakia
8d1067c81c fix(time_to_first_token.tsx): require enterprise license for usage 2024-05-22 19:20:30 -07:00
Ishaan Jaff
c535576af5 feat - update existing_team_member_spend in memory 2024-05-22 18:44:44 -07:00
Ishaan Jaff
7a8d78e6c9 feat - check team member spend, budget 2024-05-22 18:36:39 -07:00
Krish Dholakia
3400596dd2
Merge pull request #3787 from BerriAI/litellm_set_budgets_ui_2
feat(proxy_server.py): enable admin to set tpm/rpm limits for end-users via UI
2024-05-22 18:22:51 -07:00
Krish Dholakia
5a3aca10ce
Merge pull request #3789 from BerriAI/litellm_ttft_ui
feat(schema.prisma): store model id + model group as part of spend logs allows precise model metrics
2024-05-22 18:22:39 -07:00
Krrish Dholakia
259003ad8c fix(proxy_server.py): fix model metrics to return slow responses based on model group 2024-05-22 18:21:27 -07:00
Krrish Dholakia
0f2c55dd81 feat(ui/time_to_first_token.tsx): add time to first token card to model metrics 2024-05-22 18:09:53 -07:00
Ishaan Jaff
c2d25b9a14 feat - update team+user_id spend 2024-05-22 17:49:54 -07:00
Ishaan Jaff
50461eb22c feat - create budgets when team/member_add 2024-05-22 17:16:19 -07:00
Krrish Dholakia
7f0d85f12a feat(proxy_server.py): enable admin to create/delete budgets for end-users 2024-05-22 16:20:06 -07:00
Ishaan Jaff
42078ac285 fix - run tpm / rpm checks on proxy admin keys too 2024-05-22 16:15:09 -07:00
Ishaan Jaff
e6b406d739 feat - enforce end user tpm / rpm limits 2024-05-22 15:45:30 -07:00
Krrish Dholakia
9609df16d3 feat(proxy_server.py): new 'add budget' endpoint
create a budget object (max budget, tpm /rpm limits, etc.) and assign that to a user/team/etc.
2024-05-22 13:16:37 -07:00
Ishaan Jaff
b81fcf2482 feat -add failure callbacks from DB to proxy 2024-05-21 22:00:36 -07:00
Ishaan Jaff
64130e368c feat - create keys with permissions 2024-05-21 18:04:17 -07:00
Ishaan Jaff
829ca84f07
Merge pull request #3750 from BerriAI/litellm_raise_exception_when_deleting_updating_team
[Fix] - raise Exception when trying to update/delete a non-existent team
2024-05-20 18:42:27 -07:00
Krish Dholakia
707cf24472
Merge branch 'main' into litellm_webhook_support 2024-05-20 18:41:58 -07:00
Ishaan Jaff
ad91bff6f7
Merge pull request #3749 from BerriAI/litellm_raise_404_when_team_not_exist
[Fix] - raise 404 from `/team/info`  when team does not exist
2024-05-20 17:52:43 -07:00
Krish Dholakia
c6bb6e325b
Merge pull request #3740 from BerriAI/litellm_return_rejected_response
feat(proxy_server.py): allow admin to return rejected response as string to user
2024-05-20 17:48:21 -07:00
Krrish Dholakia
9d815be0b5 fix(proxy_server.py): fix error string 2024-05-20 17:37:41 -07:00
Ishaan Jaff
367d86047c fix - raise Exception when trying to update/delete a non-existtent team 2024-05-20 17:36:08 -07:00
Ishaan Jaff
f574127bc4 fix - raise 404 when team does not exist 2024-05-20 17:14:03 -07:00
Krrish Dholakia
867f9300e3 fix(slack_alerting.py): cleanup webhook event 2024-05-20 16:55:01 -07:00
Ishaan Jaff
498bfa9a4c fix - revert check_request_disconnection 2024-05-20 15:43:29 -07:00
Krrish Dholakia
da0e5d1b8d feat(slack_alerting.py): support webhook for budget alerts
Closes https://github.com/BerriAI/litellm/issues/3743
2024-05-20 15:30:56 -07:00
Ishaan Jaff
1deb274d3a
Merge pull request #3742 from BerriAI/litellm_enforce_sso
[Feat] Enforce user has a valid license when using SSO on LiteLLM Proxy
2024-05-20 14:07:23 -07:00
Ishaan Jaff
d956020470 fix error on enforce sso 2024-05-20 13:02:56 -07:00
Ishaan Jaff
561b00283c feat - enforce sso on Admin UI 2024-05-20 12:54:08 -07:00
Ishaan Jaff
8e730f57b8 only run check_request_disconnection logic for 10 mins 2024-05-20 12:39:03 -07:00
Krrish Dholakia
b41f30ca60 fix(proxy_server.py): fixes for making rejected responses work with streaming 2024-05-20 12:32:19 -07:00
Ishaan Jaff
b60a098251 docs - update openapi swagger docs 2024-05-20 12:26:42 -07:00
Krrish Dholakia
f11f207ae6 feat(proxy_server.py): refactor returning rejected message, to work with error logging
log the rejected request as a failed call to langfuse/slack alerting
2024-05-20 11:14:36 -07:00
Krrish Dholakia
372323c38a feat(proxy_server.py): allow admin to return rejected response as string to user
Closes https://github.com/BerriAI/litellm/issues/3671
2024-05-20 10:30:23 -07:00
Krrish Dholakia
25df95ab10 feat(proxy_server.py): new 'supported_openai_params' endpoint
get supported openai params for a given model
2024-05-20 08:39:50 -07:00
Krish Dholakia
5e5179e476
Merge branch 'main' into litellm_model_id_fix 2024-05-17 22:36:17 -07:00
Ishaan Jaff
8281c150f0
Merge pull request #3713 from BerriAI/litellm_ui_infer_azure_prefix
[Feat] Admin UI - use `base_model` for Slack Alerts
2024-05-17 21:55:23 -07:00
Krrish Dholakia
4b3551abfc fix(slack_alerting.py): show langfuse traces on error messages 2024-05-17 18:42:30 -07:00
Krish Dholakia
3a06fe2818
Merge branch 'main' into litellm_bedrock_anthropic_fix 2024-05-17 17:47:32 -07:00
Krrish Dholakia
b137cea230 fix(proxy_server.py): fix setting model id for db models
get model_id and use that as it's id in router, this enables `/model/delete` to work with the given id from `/model/info`
2024-05-17 17:45:05 -07:00