Ishaan Jaff
50461eb22c
feat - create budgets when team/member_add
2024-05-22 17:16:19 -07:00
Ishaan Jaff
42078ac285
fix - run tpm / rpm checks on proxy admin keys too
2024-05-22 16:15:09 -07:00
Ishaan Jaff
e6b406d739
feat - enforce end user tpm / rpm limits
2024-05-22 15:45:30 -07:00
Krrish Dholakia
9609df16d3
feat(proxy_server.py): new 'add budget' endpoint
...
create a budget object (max budget, tpm /rpm limits, etc.) and assign that to a user/team/etc.
2024-05-22 13:16:37 -07:00
Ishaan Jaff
b81fcf2482
feat -add failure callbacks from DB to proxy
2024-05-21 22:00:36 -07:00
Ishaan Jaff
64130e368c
feat - create keys with permissions
2024-05-21 18:04:17 -07:00
Ishaan Jaff
829ca84f07
Merge pull request #3750 from BerriAI/litellm_raise_exception_when_deleting_updating_team
...
[Fix] - raise Exception when trying to update/delete a non-existent team
2024-05-20 18:42:27 -07:00
Krish Dholakia
707cf24472
Merge branch 'main' into litellm_webhook_support
2024-05-20 18:41:58 -07:00
Ishaan Jaff
ad91bff6f7
Merge pull request #3749 from BerriAI/litellm_raise_404_when_team_not_exist
...
[Fix] - raise 404 from `/team/info` when team does not exist
2024-05-20 17:52:43 -07:00
Krish Dholakia
c6bb6e325b
Merge pull request #3740 from BerriAI/litellm_return_rejected_response
...
feat(proxy_server.py): allow admin to return rejected response as string to user
2024-05-20 17:48:21 -07:00
Krrish Dholakia
9d815be0b5
fix(proxy_server.py): fix error string
2024-05-20 17:37:41 -07:00
Ishaan Jaff
367d86047c
fix - raise Exception when trying to update/delete a non-existtent team
2024-05-20 17:36:08 -07:00
Ishaan Jaff
f574127bc4
fix - raise 404 when team does not exist
2024-05-20 17:14:03 -07:00
Krrish Dholakia
867f9300e3
fix(slack_alerting.py): cleanup webhook event
2024-05-20 16:55:01 -07:00
Ishaan Jaff
498bfa9a4c
fix - revert check_request_disconnection
2024-05-20 15:43:29 -07:00
Krrish Dholakia
da0e5d1b8d
feat(slack_alerting.py): support webhook for budget alerts
...
Closes https://github.com/BerriAI/litellm/issues/3743
2024-05-20 15:30:56 -07:00
Ishaan Jaff
1deb274d3a
Merge pull request #3742 from BerriAI/litellm_enforce_sso
...
[Feat] Enforce user has a valid license when using SSO on LiteLLM Proxy
2024-05-20 14:07:23 -07:00
Ishaan Jaff
d956020470
fix error on enforce sso
2024-05-20 13:02:56 -07:00
Ishaan Jaff
561b00283c
feat - enforce sso on Admin UI
2024-05-20 12:54:08 -07:00
Ishaan Jaff
8e730f57b8
only run check_request_disconnection logic for 10 mins
2024-05-20 12:39:03 -07:00
Krrish Dholakia
b41f30ca60
fix(proxy_server.py): fixes for making rejected responses work with streaming
2024-05-20 12:32:19 -07:00
Ishaan Jaff
b60a098251
docs - update openapi swagger docs
2024-05-20 12:26:42 -07:00
Krrish Dholakia
f11f207ae6
feat(proxy_server.py): refactor returning rejected message, to work with error logging
...
log the rejected request as a failed call to langfuse/slack alerting
2024-05-20 11:14:36 -07:00
Krrish Dholakia
372323c38a
feat(proxy_server.py): allow admin to return rejected response as string to user
...
Closes https://github.com/BerriAI/litellm/issues/3671
2024-05-20 10:30:23 -07:00
Krrish Dholakia
25df95ab10
feat(proxy_server.py): new 'supported_openai_params' endpoint
...
get supported openai params for a given model
2024-05-20 08:39:50 -07:00
Krish Dholakia
5e5179e476
Merge branch 'main' into litellm_model_id_fix
2024-05-17 22:36:17 -07:00
Ishaan Jaff
8281c150f0
Merge pull request #3713 from BerriAI/litellm_ui_infer_azure_prefix
...
[Feat] Admin UI - use `base_model` for Slack Alerts
2024-05-17 21:55:23 -07:00
Krrish Dholakia
4b3551abfc
fix(slack_alerting.py): show langfuse traces on error messages
2024-05-17 18:42:30 -07:00
Krish Dholakia
3a06fe2818
Merge branch 'main' into litellm_bedrock_anthropic_fix
2024-05-17 17:47:32 -07:00
Krrish Dholakia
b137cea230
fix(proxy_server.py): fix setting model id for db models
...
get model_id and use that as it's id in router, this enables `/model/delete` to work with the given id from `/model/info`
2024-05-17 17:45:05 -07:00
Ishaan Jaff
be273b3c3b
fix - show correct base_model in slack alerts
2024-05-17 16:07:02 -07:00
Krrish Dholakia
c0d62e94ae
feat(proxy_server.py): enable custom branding + routes on openapi docs
...
Allows user to add their branding + show only openai routes on docs
2024-05-17 15:21:29 -07:00
Krrish Dholakia
180bc46ca4
fix(bedrock_httpx.py): move anthropic bedrock calls to httpx
...
Fixing https://github.com/BerriAI/litellm/issues/2921
2024-05-16 21:51:55 -07:00
Krrish Dholakia
10a672634d
fix(proxy_server.py): fix invalid header string
2024-05-16 21:05:40 -07:00
Krish Dholakia
7502e15295
Merge pull request #3701 from paneru-rajan/Issue-3675-remove-empty-valued-header
...
Exclude custom headers from response if the value is None or empty string
2024-05-16 17:42:07 -07:00
Rajan Paneru
e4ce10038a
use default empty str if the allowed_model_region attribute is not present
2024-05-17 10:05:18 +09:30
Rajan Paneru
54f8d06057
handle exception and logged it
2024-05-17 09:55:13 +09:30
Ishaan Jaff
a292583ff1
fix - allow users to opt into specific alert types
2024-05-16 16:52:44 -07:00
Rajan Paneru
85679470c2
Exclude custom headers from response if the value is None or empty string
...
This will return clean header, sending a header with empty value is not standard which
is being avoided from this fix.
2024-05-17 09:06:58 +09:30
Krrish Dholakia
48714805bd
fix(proxy_server.py): fix code
2024-05-16 15:02:39 -07:00
Krish Dholakia
0a775821db
Merge branch 'main' into litellm_end_user_obj
2024-05-16 14:16:09 -07:00
Ishaan Jaff
0a816b2c45
Merge pull request #3682 from BerriAI/litellm_token_counter_endpoint
...
[Feat] `token_counter` endpoint
2024-05-16 13:39:23 -07:00
Ishaan Jaff
4a5e6aa43c
test - token count response
2024-05-16 13:20:01 -07:00
Krish Dholakia
d43f75150a
Merge pull request #3685 from BerriAI/litellm_lago_integration
...
feat(lago.py): Enable Usage-based billing with lago
2024-05-16 13:09:48 -07:00
Ishaan Jaff
d16a6c03a2
feat - include model name in cool down alerts
2024-05-16 12:52:15 -07:00
Ishaan Jaff
8c3657bad0
Merge pull request #3686 from msabramo/msabramo/fix-datetime-utcnow-deprecation-warnings
...
Fix `datetime.datetime.utcnow` `DeprecationWarning`
2024-05-16 12:19:06 -07:00
Krish Dholakia
ea976d8c30
Merge pull request #3663 from msabramo/msabramo/allow-non-admins-to-use-openai-routes
...
Allow non-admins to use `/engines/{model}/chat/completions`
2024-05-16 12:17:50 -07:00
Marc Abramowitz
4af6638be6
Fix datetime.datetime.utcnow DeprecationWarning
...
Eliminates these warning when running tests:
```
$ cd litellm/tests
pytest test_key_generate_prisma.py -x -vv
...
====================================================================== warnings summary =======================================================================
...
test_key_generate_prisma.py::test_generate_and_call_with_expired_key
test_key_generate_prisma.py::test_key_with_no_permissions
/Users/abramowi/Code/OpenSource/litellm/litellm/proxy/proxy_server.py:2934: DeprecationWarning: datetime.datetime.utcnow() is deprecated and scheduled for removal in a future version. Use timezone-aware objects to represent datetimes in UTC: datetime.datetime.now(datetime.UTC).
expires = datetime.utcnow() + timedelta(seconds=duration_s)
...
```
2024-05-16 11:56:02 -07:00
Ishaan Jaff
22ba5fa186
feat - try using hf tokenizer
2024-05-16 10:59:29 -07:00
Krrish Dholakia
e273e66618
feat(lago.py): adding support for usage-based billing with lago
...
Closes https://github.com/BerriAI/litellm/issues/3639
2024-05-16 10:54:18 -07:00