Commit graph

2013 commits

Author SHA1 Message Date
Krrish Dholakia
c096ba566f fix(proxy_server.py): fix cache param arg name 2024-04-01 19:14:39 -07:00
Krrish Dholakia
203e2776f8 fix(proxy_server.py): allow user to set in-memory + redis ttl
addresses - https://github.com/BerriAI/litellm/issues/2700
2024-04-01 19:13:23 -07:00
Ishaan Jaff
037b624c89 (fix) allow wildcard models 2024-04-01 19:07:05 -07:00
Krrish Dholakia
c52819d47c fix(proxy_server.py): don't require scope for team-based jwt access
If team with the client_id exists then it should be allowed to make a request, if it doesn't then as we discussed it should return an error
2024-04-01 18:52:00 -07:00
Ishaan Jaff
b14b6083f5
Merge pull request #2785 from BerriAI/litellm_high_traffic_redis_caching_fixes
[Feat] Proxy - high traffic redis caching - when using `url`
2024-04-01 18:38:27 -07:00
Krrish Dholakia
6467dd4e11 fix(tpm_rpm_limiter.py): fix cache init logic 2024-04-01 18:01:38 -07:00
Ishaan Jaff
9accc544e9 add /cache/redis/info endpoint 2024-04-01 16:51:23 -07:00
Krrish Dholakia
9c0aecf9b8 fix(proxy/utils.py): support redis caching for alerting 2024-04-01 16:13:59 -07:00
Krrish Dholakia
cdae08f3c3 docs(openai.md): fix docs to include example of calling openai on proxy 2024-04-01 12:09:22 -07:00
Ishaan Jaff
d5d800e141 (fix) _update_end_user_cache 2024-04-01 11:18:00 -07:00
Krrish Dholakia
c9e6b05cfb test(test_max_tpm_rpm_limiter.py): add unit testing for redis namespaces working for tpm/rpm limits 2024-04-01 10:39:03 -07:00
Krrish Dholakia
383f12bbd3 test(test_max_tpm_rpm_limiter.py): unit tests for key + team based tpm rpm limits on proxy 2024-04-01 08:00:01 -07:00
Krish Dholakia
1356f6cd32
Merge pull request #2775 from BerriAI/litellm_redis_user_api_key_cache_v3
fix(tpm_rpm_limiter.py): enable redis caching for tpm/rpm checks on keys/user/teams
2024-03-30 22:07:05 -07:00
Krrish Dholakia
3b8e7241b4 fix(proxy/utils.py): uncomment max parallel request limit check 2024-03-30 20:51:59 -07:00
Krrish Dholakia
d9ff13b624 fix(utils.py): set redis_usage_cache to none by default 2024-03-30 20:10:56 -07:00
Krrish Dholakia
f58fefd589 fix(tpm_rpm_limiter.py): enable redis caching for tpm/rpm checks on keys/user/teams
allows tpm/rpm checks to work across instances

https://github.com/BerriAI/litellm/issues/2730
2024-03-30 20:01:36 -07:00
Ishaan Jaff
22a22197c7 (ui) new ui build 2024-03-30 14:38:19 -07:00
Ishaan Jaff
23a18d4be3 (ui) show proxy spend 2024-03-30 14:02:43 -07:00
Ishaan Jaff
8daca76566 (ui) view spend by team name on usage 2024-03-30 13:25:32 -07:00
Ishaan Jaff
77b6cd01df (ui) new build 2024-03-30 12:17:07 -07:00
Ishaan Jaff
58cc11a312
Merge branch 'main' into litellm_edit_teams 2024-03-30 12:15:53 -07:00
Ishaan Jaff
46c9b57a29 (new) ui build 2024-03-30 12:14:43 -07:00
Ishaan Jaff
e6a0942780 (new) ui build 2024-03-30 12:10:25 -07:00
Krrish Dholakia
0342cd3b6b fix(proxy_server.py): support azure openai text completion calls 2024-03-30 11:30:06 -07:00
Krrish Dholakia
af2eabba91 fix(proxy_server.py): fix /key/update endpoint to update key duration
also adds a test for this to our ci/cd
2024-03-29 21:47:10 -07:00
Krrish Dholakia
68a1fa36cf fix(handle_jwt.py): fix linting issue 2024-03-29 21:35:36 -07:00
Krish Dholakia
6d9887969f
Merge pull request #2757 from BerriAI/litellm_fix_budget_alerts
fix(auth_checks.py): make global spend checks more accurate
2024-03-29 21:13:27 -07:00
Krish Dholakia
f44df7bed6
Merge pull request #2762 from jacobnguyenn/update-jwt-auth
fix: still verify kid on single pub key fetched
2024-03-29 21:08:22 -07:00
Krrish Dholakia
48ac36e70d docs(proxy_server.py): fix example on swagger for team member delete 2024-03-29 20:09:54 -07:00
phuongnd96
4bf4be3a0f
fix: still verify kid on single pub key fetched 2024-03-30 10:09:38 +07:00
Krrish Dholakia
3810b050c1 fix(proxy_server.py): increment cached global proxy spend object 2024-03-29 20:02:31 -07:00
Krrish Dholakia
5280fc809f fix(proxy_server.py): enforce end user budgets with 'litellm.max_end_user_budget' param 2024-03-29 17:14:40 -07:00
Ishaan Jaff
062239feb5 (ui) new build 2024-03-29 16:38:18 -07:00
Krrish Dholakia
786116783f fix(proxy_server.py): fix max budget check to also fire slack alert 2024-03-29 16:24:40 -07:00
Ishaan Jaff
733d36e275 (ui new build) 2024-03-29 16:09:29 -07:00
Krrish Dholakia
be6481bb36 fix(proxy_server.py): fix checks 2024-03-29 15:34:13 -07:00
Ishaan Jaff
aa50963674 bump: version 1.34.13 → 1.34.14 2024-03-29 15:19:10 -07:00
Ishaan Jaff
0ac7512821 (feat) new ui build 2024-03-29 15:16:59 -07:00
Krrish Dholakia
d8c15a5677 fix(auth_checks.py): make global spend checks more accurate 2024-03-29 14:57:44 -07:00
Ishaan Jaff
24570bc075 (docs) grafana / prometheus 2024-03-29 14:25:45 -07:00
Ishaan Jaff
d3e72e1c3b (fix) create key flow 2024-03-29 10:08:35 -07:00
Ishaan Jaff
7df2d7cb33 (fix) show correct spend on ui 2024-03-29 09:41:00 -07:00
Krrish Dholakia
618cb2441a fix(proxy_server.py): handle pydantic json versioning issue 2024-03-28 23:23:50 -07:00
Krrish Dholakia
5e55df1eaf fix(proxy_server.py): fix pydantic versioning issue 2024-03-28 22:42:26 -07:00
Krish Dholakia
a1a2fb4223
Merge pull request #2741 from BerriAI/litellm_jwt_auth_fixes_3
fix(proxy_server.py): fix tpm/rpm limiting for jwt auth
2024-03-28 22:16:31 -07:00
Krrish Dholakia
5a117490ec fix(proxy_server.py): fix tpm/rpm limiting for jwt auth
fixes tpm/rpm limiting for jwt auth and implements unit tests for jwt auth
2024-03-28 21:19:34 -07:00
Ishaan Jaff
036d9c3cad
Merge pull request #2740 from BerriAI/litellm_ui_fixes
(ui) new build
2024-03-28 21:03:15 -07:00
Ishaan Jaff
bd207fe228 (ui) new build 2024-03-28 21:00:46 -07:00
Ishaan Jaff
d33b099de0
Merge pull request #2739 from BerriAI/litellm_ui_fixes
(fix) ui - clean up username display
2024-03-28 20:39:58 -07:00
Ishaan Jaff
94efa2d8dc (new ui build) 2024-03-28 20:38:30 -07:00