Krrish Dholakia
|
9c6eb643f8
|
fix(proxy_server.py): fix model metrics to return slow responses based on model group
|
2024-05-22 18:21:27 -07:00 |
|
Krrish Dholakia
|
06606dc01b
|
feat(ui/time_to_first_token.tsx): add time to first token card to model metrics
|
2024-05-22 18:09:53 -07:00 |
|
Ishaan Jaff
|
5a013fb7be
|
feat - update team+user_id spend
|
2024-05-22 17:49:54 -07:00 |
|
Ishaan Jaff
|
e22cfa55c4
|
feat - create budgets when team/member_add
|
2024-05-22 17:16:19 -07:00 |
|
Krrish Dholakia
|
5c0cb8c326
|
feat(proxy_server.py): enable admin to create/delete budgets for end-users
|
2024-05-22 16:20:06 -07:00 |
|
Ishaan Jaff
|
b391755c43
|
fix - run tpm / rpm checks on proxy admin keys too
|
2024-05-22 16:15:09 -07:00 |
|
Ishaan Jaff
|
6aa41e1698
|
feat - enforce end user tpm / rpm limits
|
2024-05-22 15:45:30 -07:00 |
|
Krrish Dholakia
|
7da614f3b7
|
feat(proxy_server.py): new 'add budget' endpoint
create a budget object (max budget, tpm /rpm limits, etc.) and assign that to a user/team/etc.
|
2024-05-22 13:16:37 -07:00 |
|
Ishaan Jaff
|
77cd332115
|
feat -add failure callbacks from DB to proxy
|
2024-05-21 22:00:36 -07:00 |
|
Ishaan Jaff
|
087e68c690
|
feat - create keys with permissions
|
2024-05-21 18:04:17 -07:00 |
|
Ishaan Jaff
|
383c928d59
|
Merge pull request #3750 from BerriAI/litellm_raise_exception_when_deleting_updating_team
[Fix] - raise Exception when trying to update/delete a non-existent team
|
2024-05-20 18:42:27 -07:00 |
|
Krish Dholakia
|
f3a47e4b1a
|
Merge branch 'main' into litellm_webhook_support
|
2024-05-20 18:41:58 -07:00 |
|
Ishaan Jaff
|
f1dd893e83
|
Merge pull request #3749 from BerriAI/litellm_raise_404_when_team_not_exist
[Fix] - raise 404 from `/team/info` when team does not exist
|
2024-05-20 17:52:43 -07:00 |
|
Krish Dholakia
|
db77e41833
|
Merge pull request #3740 from BerriAI/litellm_return_rejected_response
feat(proxy_server.py): allow admin to return rejected response as string to user
|
2024-05-20 17:48:21 -07:00 |
|
Krrish Dholakia
|
c6b4ad247b
|
fix(proxy_server.py): fix error string
|
2024-05-20 17:37:41 -07:00 |
|
Ishaan Jaff
|
df89da8573
|
fix - raise Exception when trying to update/delete a non-existtent team
|
2024-05-20 17:36:08 -07:00 |
|
Ishaan Jaff
|
f94349297c
|
fix - raise 404 when team does not exist
|
2024-05-20 17:14:03 -07:00 |
|
Krrish Dholakia
|
fd13a830ab
|
fix(slack_alerting.py): cleanup webhook event
|
2024-05-20 16:55:01 -07:00 |
|
Ishaan Jaff
|
508bff13ab
|
fix - revert check_request_disconnection
|
2024-05-20 15:43:29 -07:00 |
|
Krrish Dholakia
|
c0d5e83570
|
feat(slack_alerting.py): support webhook for budget alerts
Closes https://github.com/BerriAI/litellm/issues/3743
|
2024-05-20 15:30:56 -07:00 |
|
Ishaan Jaff
|
4f3a90bb2e
|
Merge pull request #3742 from BerriAI/litellm_enforce_sso
[Feat] Enforce user has a valid license when using SSO on LiteLLM Proxy
|
2024-05-20 14:07:23 -07:00 |
|
Ishaan Jaff
|
9d2f60079d
|
fix error on enforce sso
|
2024-05-20 13:02:56 -07:00 |
|
Ishaan Jaff
|
77ac6d9b74
|
feat - enforce sso on Admin UI
|
2024-05-20 12:54:08 -07:00 |
|
Ishaan Jaff
|
44c998293d
|
only run check_request_disconnection logic for 10 mins
|
2024-05-20 12:39:03 -07:00 |
|
Krrish Dholakia
|
d4d4550bb6
|
fix(proxy_server.py): fixes for making rejected responses work with streaming
|
2024-05-20 12:32:19 -07:00 |
|
Ishaan Jaff
|
df4e6b6576
|
docs - update openapi swagger docs
|
2024-05-20 12:26:42 -07:00 |
|
Krrish Dholakia
|
8fb8d068fb
|
feat(proxy_server.py): refactor returning rejected message, to work with error logging
log the rejected request as a failed call to langfuse/slack alerting
|
2024-05-20 11:14:36 -07:00 |
|
Krrish Dholakia
|
45fedb83c6
|
feat(proxy_server.py): allow admin to return rejected response as string to user
Closes https://github.com/BerriAI/litellm/issues/3671
|
2024-05-20 10:30:23 -07:00 |
|
Krrish Dholakia
|
0016477d3b
|
feat(proxy_server.py): new 'supported_openai_params' endpoint
get supported openai params for a given model
|
2024-05-20 08:39:50 -07:00 |
|
Krish Dholakia
|
90e9bf7769
|
Merge branch 'main' into litellm_model_id_fix
|
2024-05-17 22:36:17 -07:00 |
|
Ishaan Jaff
|
455bae52ae
|
Merge pull request #3713 from BerriAI/litellm_ui_infer_azure_prefix
[Feat] Admin UI - use `base_model` for Slack Alerts
|
2024-05-17 21:55:23 -07:00 |
|
Krrish Dholakia
|
1ccc604953
|
fix(slack_alerting.py): show langfuse traces on error messages
|
2024-05-17 18:42:30 -07:00 |
|
Krish Dholakia
|
60615f46c4
|
Merge branch 'main' into litellm_bedrock_anthropic_fix
|
2024-05-17 17:47:32 -07:00 |
|
Krrish Dholakia
|
45b7106951
|
fix(proxy_server.py): fix setting model id for db models
get model_id and use that as it's id in router, this enables `/model/delete` to work with the given id from `/model/info`
|
2024-05-17 17:45:05 -07:00 |
|
Ishaan Jaff
|
f92dd0ec9b
|
fix - show correct base_model in slack alerts
|
2024-05-17 16:07:02 -07:00 |
|
Krrish Dholakia
|
9ab2389b7e
|
feat(proxy_server.py): enable custom branding + routes on openapi docs
Allows user to add their branding + show only openai routes on docs
|
2024-05-17 15:21:29 -07:00 |
|
Krrish Dholakia
|
118fc4ffac
|
fix(bedrock_httpx.py): move anthropic bedrock calls to httpx
Fixing https://github.com/BerriAI/litellm/issues/2921
|
2024-05-16 21:51:55 -07:00 |
|
Krrish Dholakia
|
b639e9598b
|
fix(proxy_server.py): fix invalid header string
|
2024-05-16 21:05:40 -07:00 |
|
Krish Dholakia
|
854d173423
|
Merge pull request #3701 from paneru-rajan/Issue-3675-remove-empty-valued-header
Exclude custom headers from response if the value is None or empty string
|
2024-05-16 17:42:07 -07:00 |
|
Rajan Paneru
|
98d8f1d100
|
use default empty str if the allowed_model_region attribute is not present
|
2024-05-17 10:05:18 +09:30 |
|
Rajan Paneru
|
29e5ec1112
|
handle exception and logged it
|
2024-05-17 09:55:13 +09:30 |
|
Ishaan Jaff
|
70194f7ba2
|
fix - allow users to opt into specific alert types
|
2024-05-16 16:52:44 -07:00 |
|
Rajan Paneru
|
fa38c592b1
|
Exclude custom headers from response if the value is None or empty string
This will return clean header, sending a header with empty value is not standard which
is being avoided from this fix.
|
2024-05-17 09:06:58 +09:30 |
|
Krrish Dholakia
|
b9c7abf574
|
fix(proxy_server.py): fix code
|
2024-05-16 15:02:39 -07:00 |
|
Krish Dholakia
|
ae89240714
|
Merge branch 'main' into litellm_end_user_obj
|
2024-05-16 14:16:09 -07:00 |
|
Ishaan Jaff
|
3a52073b48
|
Merge pull request #3682 from BerriAI/litellm_token_counter_endpoint
[Feat] `token_counter` endpoint
|
2024-05-16 13:39:23 -07:00 |
|
Ishaan Jaff
|
11794f5fc5
|
test - token count response
|
2024-05-16 13:20:01 -07:00 |
|
Krish Dholakia
|
4c8b620352
|
Merge pull request #3685 from BerriAI/litellm_lago_integration
feat(lago.py): Enable Usage-based billing with lago
|
2024-05-16 13:09:48 -07:00 |
|
Ishaan Jaff
|
5ba4f5b4f1
|
feat - include model name in cool down alerts
|
2024-05-16 12:52:15 -07:00 |
|
Ishaan Jaff
|
bccc38acad
|
Merge pull request #3686 from msabramo/msabramo/fix-datetime-utcnow-deprecation-warnings
Fix `datetime.datetime.utcnow` `DeprecationWarning`
|
2024-05-16 12:19:06 -07:00 |
|