Ishaan Jaff
|
03ec8219a4
|
fix - lakera ai integration
|
2024-05-23 15:25:26 -07:00 |
|
Ishaan Jaff
|
ec41226ee8
|
feat - add lakera ai prompt injection detection
|
2024-05-23 15:16:46 -07:00 |
|
Ishaan Jaff
|
35146d8628
|
docs - content mod
|
2024-05-23 14:33:52 -07:00 |
|
Ishaan Jaff
|
ea76432844
|
Merge pull request #3802 from BerriAI/litellm_add_moderations_safety_check_proxy
[Feat]- Proxy Add OpenAI Content Moderation Pre call hook
|
2024-05-23 14:30:49 -07:00 |
|
Ishaan Jaff
|
6d267fc7ad
|
test - openai moderation response
|
2024-05-23 13:24:46 -07:00 |
|
Ishaan Jaff
|
6aad4e38d3
|
feat - add debugging for moderation response
|
2024-05-23 13:24:09 -07:00 |
|
Ishaan Jaff
|
1fe035c6dd
|
feat - add open ai moderations check
|
2024-05-23 13:08:06 -07:00 |
|
Krrish Dholakia
|
54591e3920
|
test(test_key_generate_prisma.py): fix tests with unique team id
|
2024-05-23 12:55:26 -07:00 |
|
Krrish Dholakia
|
84f3690453
|
test(test_key_generate_prisma.py): fix tests with unique team id
|
2024-05-23 08:46:44 -07:00 |
|
Krrish Dholakia
|
5b1f928341
|
docs(users.md): add ui infographic to users.md
|
2024-05-22 21:11:00 -07:00 |
|
Krrish Dholakia
|
bcbd4feef4
|
bump: version 1.38.0 → 1.38.1
|
2024-05-22 20:56:47 -07:00 |
|
Krrish Dholakia
|
a33b9221da
|
fix(proxy_server.py): fix admin allowed routes
|
2024-05-22 20:55:21 -07:00 |
|
Ishaan Jaff
|
595a2a5b1b
|
fix test_key_model_list
|
2024-05-22 20:49:19 -07:00 |
|
Ishaan Jaff
|
405d5aa726
|
Merge pull request #3791 from BerriAI/litellm_fix_dont_allow_creating_existing_team_ids
[Fix] raise exception when creating team_id for existing team_id on `/team/new`
|
2024-05-22 20:35:34 -07:00 |
|
Ishaan Jaff
|
9ba7d26597
|
feat - raise exception when creating team_id for exiting team_id
|
2024-05-22 20:26:56 -07:00 |
|
Ishaan Jaff
|
b6b86dc539
|
ci/cd run again
|
2024-05-22 20:19:05 -07:00 |
|
Krrish Dholakia
|
851f26b759
|
fix(slack_alerting.py): fix time check + add more debug values
|
2024-05-22 20:11:36 -07:00 |
|
Ishaan Jaff
|
288303f797
|
(ci/cd) run again - new ui build
|
2024-05-22 20:04:10 -07:00 |
|
Ishaan Jaff
|
97f8cb63f2
|
(ci/cd) run again
|
2024-05-22 20:00:41 -07:00 |
|
Ishaan Jaff
|
41093bae2c
|
bump: version 1.37.20 → 1.38.0
|
2024-05-22 19:46:24 -07:00 |
|
Ishaan Jaff
|
a8b64a01dc
|
Merge pull request #3790 from BerriAI/litellm_set_team_member_budgets
[Feat] Set Budgets for Users within a Team
|
2024-05-22 19:44:04 -07:00 |
|
Ishaan Jaff
|
095b55a02e
|
docs - set budgets for members in a team
|
2024-05-22 19:43:48 -07:00 |
|
Krrish Dholakia
|
8d1067c81c
|
fix(time_to_first_token.tsx): require enterprise license for usage
|
2024-05-22 19:20:30 -07:00 |
|
Ishaan Jaff
|
20aa84e110
|
test - team member budgets
|
2024-05-22 19:19:51 -07:00 |
|
Ishaan Jaff
|
c535576af5
|
feat - update existing_team_member_spend in memory
|
2024-05-22 18:44:44 -07:00 |
|
Ishaan Jaff
|
7a8d78e6c9
|
feat - check team member spend, budget
|
2024-05-22 18:36:39 -07:00 |
|
Krish Dholakia
|
3400596dd2
|
Merge pull request #3787 from BerriAI/litellm_set_budgets_ui_2
feat(proxy_server.py): enable admin to set tpm/rpm limits for end-users via UI
|
2024-05-22 18:22:51 -07:00 |
|
Krish Dholakia
|
5a3aca10ce
|
Merge pull request #3789 from BerriAI/litellm_ttft_ui
feat(schema.prisma): store model id + model group as part of spend logs allows precise model metrics
|
2024-05-22 18:22:39 -07:00 |
|
Krrish Dholakia
|
259003ad8c
|
fix(proxy_server.py): fix model metrics to return slow responses based on model group
|
2024-05-22 18:21:27 -07:00 |
|
Krrish Dholakia
|
0f2c55dd81
|
feat(ui/time_to_first_token.tsx): add time to first token card to model metrics
|
2024-05-22 18:09:53 -07:00 |
|
Ishaan Jaff
|
c2d25b9a14
|
feat - update team+user_id spend
|
2024-05-22 17:49:54 -07:00 |
|
Krrish Dholakia
|
3c0e9cb48e
|
feat(schema.prisma): store model id + model group as part of spend logs
allows precise model metrics
|
2024-05-22 17:29:44 -07:00 |
|
Ishaan Jaff
|
f548334e8b
|
build - prisma schema change
|
2024-05-22 17:19:31 -07:00 |
|
Ishaan Jaff
|
50461eb22c
|
feat - create budgets when team/member_add
|
2024-05-22 17:16:19 -07:00 |
|
Ishaan Jaff
|
31fc6d79af
|
Merge pull request #3785 from BerriAI/litellm_end_user_rate_limits
[Feat] LiteLLM Proxy: Enforce End-User TPM, RPM Limits
|
2024-05-22 17:12:58 -07:00 |
|
Ishaan Jaff
|
a4cf453ad1
|
fix - end user rate limiting tests
|
2024-05-22 16:46:19 -07:00 |
|
Krrish Dholakia
|
9f518030b9
|
build(schema.prisma): add completion_start_time to spend logs
enables time to first token tracking for models
|
2024-05-22 16:43:08 -07:00 |
|
Ishaan Jaff
|
4175d00a24
|
fix - test end user rate limits with master key
|
2024-05-22 16:23:15 -07:00 |
|
Ishaan Jaff
|
a848a676af
|
docs - end user rate limiting
|
2024-05-22 16:20:25 -07:00 |
|
Krrish Dholakia
|
7f0d85f12a
|
feat(proxy_server.py): enable admin to create/delete budgets for end-users
|
2024-05-22 16:20:06 -07:00 |
|
Ishaan Jaff
|
42078ac285
|
fix - run tpm / rpm checks on proxy admin keys too
|
2024-05-22 16:15:09 -07:00 |
|
Ishaan Jaff
|
e6b406d739
|
feat - enforce end user tpm / rpm limits
|
2024-05-22 15:45:30 -07:00 |
|
Ishaan Jaff
|
bef10f4b01
|
test - end user tpm / rpm limiting
|
2024-05-22 15:42:41 -07:00 |
|
Ishaan Jaff
|
106910cecf
|
feat - add end user rate limiting
|
2024-05-22 14:01:57 -07:00 |
|
Ishaan Jaff
|
eac7e70dca
|
feat - include litellm_budget table when getting end_user
|
2024-05-22 13:30:08 -07:00 |
|
Krrish Dholakia
|
9609df16d3
|
feat(proxy_server.py): new 'add budget' endpoint
create a budget object (max budget, tpm /rpm limits, etc.) and assign that to a user/team/etc.
|
2024-05-22 13:16:37 -07:00 |
|
Ishaan Jaff
|
778b41cea5
|
fix - update migration script to ensure api_key in script
|
2024-05-22 12:22:23 -07:00 |
|
Ishaan Jaff
|
0623eb0274
|
Merge pull request #3783 from BerriAI/litellm_fix_dont_use_gunicorn_helm
[Fix] - don't use `gunicorn` on litellm helm
|
2024-05-22 07:50:47 -07:00 |
|
Ishaan Jaff
|
815dcd9187
|
fix - don't use gunicorn on litellm helm
|
2024-05-22 07:48:34 -07:00 |
|
Ishaan Jaff
|
4e5be51373
|
Merge pull request #3775 from BerriAI/litellm_add_failure_Callbacks_from_db
[Feat] add failure callbacks from DB to proxy
|
2024-05-22 07:36:04 -07:00 |
|