Krrish Dholakia
|
cdae08f3c3
|
docs(openai.md): fix docs to include example of calling openai on proxy
|
2024-04-01 12:09:22 -07:00 |
|
Ishaan Jaff
|
d5d800e141
|
(fix) _update_end_user_cache
|
2024-04-01 11:18:00 -07:00 |
|
Krrish Dholakia
|
c9e6b05cfb
|
test(test_max_tpm_rpm_limiter.py): add unit testing for redis namespaces working for tpm/rpm limits
|
2024-04-01 10:39:03 -07:00 |
|
Krrish Dholakia
|
383f12bbd3
|
test(test_max_tpm_rpm_limiter.py): unit tests for key + team based tpm rpm limits on proxy
|
2024-04-01 08:00:01 -07:00 |
|
Krish Dholakia
|
1356f6cd32
|
Merge pull request #2775 from BerriAI/litellm_redis_user_api_key_cache_v3
fix(tpm_rpm_limiter.py): enable redis caching for tpm/rpm checks on keys/user/teams
|
2024-03-30 22:07:05 -07:00 |
|
Krrish Dholakia
|
3b8e7241b4
|
fix(proxy/utils.py): uncomment max parallel request limit check
|
2024-03-30 20:51:59 -07:00 |
|
Krrish Dholakia
|
d9ff13b624
|
fix(utils.py): set redis_usage_cache to none by default
|
2024-03-30 20:10:56 -07:00 |
|
Krrish Dholakia
|
f58fefd589
|
fix(tpm_rpm_limiter.py): enable redis caching for tpm/rpm checks on keys/user/teams
allows tpm/rpm checks to work across instances
https://github.com/BerriAI/litellm/issues/2730
|
2024-03-30 20:01:36 -07:00 |
|
Ishaan Jaff
|
22a22197c7
|
(ui) new ui build
|
2024-03-30 14:38:19 -07:00 |
|
Ishaan Jaff
|
23a18d4be3
|
(ui) show proxy spend
|
2024-03-30 14:02:43 -07:00 |
|
Ishaan Jaff
|
8daca76566
|
(ui) view spend by team name on usage
|
2024-03-30 13:25:32 -07:00 |
|
Ishaan Jaff
|
77b6cd01df
|
(ui) new build
|
2024-03-30 12:17:07 -07:00 |
|
Ishaan Jaff
|
58cc11a312
|
Merge branch 'main' into litellm_edit_teams
|
2024-03-30 12:15:53 -07:00 |
|
Ishaan Jaff
|
46c9b57a29
|
(new) ui build
|
2024-03-30 12:14:43 -07:00 |
|
Ishaan Jaff
|
e6a0942780
|
(new) ui build
|
2024-03-30 12:10:25 -07:00 |
|
Krrish Dholakia
|
0342cd3b6b
|
fix(proxy_server.py): support azure openai text completion calls
|
2024-03-30 11:30:06 -07:00 |
|
Krrish Dholakia
|
af2eabba91
|
fix(proxy_server.py): fix /key/update endpoint to update key duration
also adds a test for this to our ci/cd
|
2024-03-29 21:47:10 -07:00 |
|
Krrish Dholakia
|
68a1fa36cf
|
fix(handle_jwt.py): fix linting issue
|
2024-03-29 21:35:36 -07:00 |
|
Krish Dholakia
|
6d9887969f
|
Merge pull request #2757 from BerriAI/litellm_fix_budget_alerts
fix(auth_checks.py): make global spend checks more accurate
|
2024-03-29 21:13:27 -07:00 |
|
Krish Dholakia
|
f44df7bed6
|
Merge pull request #2762 from jacobnguyenn/update-jwt-auth
fix: still verify kid on single pub key fetched
|
2024-03-29 21:08:22 -07:00 |
|
Krrish Dholakia
|
48ac36e70d
|
docs(proxy_server.py): fix example on swagger for team member delete
|
2024-03-29 20:09:54 -07:00 |
|
phuongnd96
|
4bf4be3a0f
|
fix: still verify kid on single pub key fetched
|
2024-03-30 10:09:38 +07:00 |
|
Krrish Dholakia
|
3810b050c1
|
fix(proxy_server.py): increment cached global proxy spend object
|
2024-03-29 20:02:31 -07:00 |
|
Krrish Dholakia
|
5280fc809f
|
fix(proxy_server.py): enforce end user budgets with 'litellm.max_end_user_budget' param
|
2024-03-29 17:14:40 -07:00 |
|
Ishaan Jaff
|
062239feb5
|
(ui) new build
|
2024-03-29 16:38:18 -07:00 |
|
Krrish Dholakia
|
786116783f
|
fix(proxy_server.py): fix max budget check to also fire slack alert
|
2024-03-29 16:24:40 -07:00 |
|
Ishaan Jaff
|
733d36e275
|
(ui new build)
|
2024-03-29 16:09:29 -07:00 |
|
Krrish Dholakia
|
be6481bb36
|
fix(proxy_server.py): fix checks
|
2024-03-29 15:34:13 -07:00 |
|
Ishaan Jaff
|
aa50963674
|
bump: version 1.34.13 → 1.34.14
|
2024-03-29 15:19:10 -07:00 |
|
Ishaan Jaff
|
0ac7512821
|
(feat) new ui build
|
2024-03-29 15:16:59 -07:00 |
|
Krrish Dholakia
|
d8c15a5677
|
fix(auth_checks.py): make global spend checks more accurate
|
2024-03-29 14:57:44 -07:00 |
|
Ishaan Jaff
|
24570bc075
|
(docs) grafana / prometheus
|
2024-03-29 14:25:45 -07:00 |
|
Ishaan Jaff
|
d3e72e1c3b
|
(fix) create key flow
|
2024-03-29 10:08:35 -07:00 |
|
Ishaan Jaff
|
7df2d7cb33
|
(fix) show correct spend on ui
|
2024-03-29 09:41:00 -07:00 |
|
Krrish Dholakia
|
618cb2441a
|
fix(proxy_server.py): handle pydantic json versioning issue
|
2024-03-28 23:23:50 -07:00 |
|
Krrish Dholakia
|
5e55df1eaf
|
fix(proxy_server.py): fix pydantic versioning issue
|
2024-03-28 22:42:26 -07:00 |
|
Krish Dholakia
|
a1a2fb4223
|
Merge pull request #2741 from BerriAI/litellm_jwt_auth_fixes_3
fix(proxy_server.py): fix tpm/rpm limiting for jwt auth
|
2024-03-28 22:16:31 -07:00 |
|
Krrish Dholakia
|
5a117490ec
|
fix(proxy_server.py): fix tpm/rpm limiting for jwt auth
fixes tpm/rpm limiting for jwt auth and implements unit tests for jwt auth
|
2024-03-28 21:19:34 -07:00 |
|
Ishaan Jaff
|
036d9c3cad
|
Merge pull request #2740 from BerriAI/litellm_ui_fixes
(ui) new build
|
2024-03-28 21:03:15 -07:00 |
|
Ishaan Jaff
|
bd207fe228
|
(ui) new build
|
2024-03-28 21:00:46 -07:00 |
|
Ishaan Jaff
|
d33b099de0
|
Merge pull request #2739 from BerriAI/litellm_ui_fixes
(fix) ui - clean up username display
|
2024-03-28 20:39:58 -07:00 |
|
Ishaan Jaff
|
94efa2d8dc
|
(new ui build)
|
2024-03-28 20:38:30 -07:00 |
|
Krrish Dholakia
|
c15ba368e7
|
fix(proxy_server.py): enable spend tracking for team-based jwt auth
|
2024-03-28 20:16:22 -07:00 |
|
Ishaan Jaff
|
e03a59a0d8
|
(feat) new ui build
|
2024-03-28 19:57:25 -07:00 |
|
Ishaan Jaff
|
792999d756
|
Merge pull request #2738 from BerriAI/litellm_ui_cleanup
[Admin UI] Use consistent spacing, show mandatory fields
|
2024-03-28 19:08:56 -07:00 |
|
Ishaan Jaff
|
1a4583fa54
|
(new ui build)
|
2024-03-28 19:08:16 -07:00 |
|
Krrish Dholakia
|
7c44b32cc2
|
refactor(proxy/utils.py): add more debug logs
|
2024-03-28 18:44:35 -07:00 |
|
Krrish Dholakia
|
19e43c8d29
|
fix(proxy_server.py): return deleted team id's to confirm /team/delete worked
|
2024-03-28 16:09:33 -07:00 |
|
Krrish Dholakia
|
f80b500917
|
docs(proxy_server.py): add update team TPM/RPM limit to swagger docs
|
2024-03-28 16:02:51 -07:00 |
|
Krrish Dholakia
|
58bdbecbc6
|
fix(proxy_server.py): allow updating tpm/rpm limits via /team/update
|
2024-03-28 15:59:35 -07:00 |
|