Commit graph

1196 commits

Author SHA1 Message Date
Krrish Dholakia
74a943bc62 fix(proxy_server.py): fix cache param arg name 2024-04-01 19:14:39 -07:00
Krrish Dholakia
aa5f88ec45 fix(proxy_server.py): allow user to set in-memory + redis ttl
addresses - https://github.com/BerriAI/litellm/issues/2700
2024-04-01 19:13:23 -07:00
Ishaan Jaff
716fcd3ec4 (fix) allow wildcard models 2024-04-01 19:07:05 -07:00
Krrish Dholakia
abf87fe448 fix(proxy_server.py): don't require scope for team-based jwt access
If team with the client_id exists then it should be allowed to make a request, if it doesn't then as we discussed it should return an error
2024-04-01 18:52:00 -07:00
Ishaan Jaff
96d7377824 Merge pull request #2785 from BerriAI/litellm_high_traffic_redis_caching_fixes
[Feat] Proxy - high traffic redis caching - when using `url`
2024-04-01 18:38:27 -07:00
Krrish Dholakia
e06d43dc90 fix(tpm_rpm_limiter.py): fix cache init logic 2024-04-01 18:01:38 -07:00
Ishaan Jaff
86ba03e08a add /cache/redis/info endpoint 2024-04-01 16:51:23 -07:00
Ishaan Jaff
aff63e556c (fix) _update_end_user_cache 2024-04-01 11:18:00 -07:00
Krrish Dholakia
f68658b2da test(test_max_tpm_rpm_limiter.py): add unit testing for redis namespaces working for tpm/rpm limits 2024-04-01 10:39:03 -07:00
Krish Dholakia
f635d15c2b Merge pull request #2775 from BerriAI/litellm_redis_user_api_key_cache_v3
fix(tpm_rpm_limiter.py): enable redis caching for tpm/rpm checks on keys/user/teams
2024-03-30 22:07:05 -07:00
Krrish Dholakia
555f0af027 fix(tpm_rpm_limiter.py): enable redis caching for tpm/rpm checks on keys/user/teams
allows tpm/rpm checks to work across instances

https://github.com/BerriAI/litellm/issues/2730
2024-03-30 20:01:36 -07:00
Ishaan Jaff
e1f3634855 (ui) show proxy spend 2024-03-30 14:02:43 -07:00
Ishaan Jaff
9e0e389109 (ui) view spend by team name on usage 2024-03-30 13:25:32 -07:00
Krrish Dholakia
67f6f8871d fix(proxy_server.py): support azure openai text completion calls 2024-03-30 11:30:06 -07:00
Krrish Dholakia
cda0c49b20 fix(proxy_server.py): fix /key/update endpoint to update key duration
also adds a test for this to our ci/cd
2024-03-29 21:47:10 -07:00
Krish Dholakia
6394a001a2 Merge pull request #2757 from BerriAI/litellm_fix_budget_alerts
fix(auth_checks.py): make global spend checks more accurate
2024-03-29 21:13:27 -07:00
Krrish Dholakia
5ad1251b4b docs(proxy_server.py): fix example on swagger for team member delete 2024-03-29 20:09:54 -07:00
Krrish Dholakia
9fc8b8a215 fix(proxy_server.py): increment cached global proxy spend object 2024-03-29 20:02:31 -07:00
Krrish Dholakia
49e2624240 fix(proxy_server.py): enforce end user budgets with 'litellm.max_end_user_budget' param 2024-03-29 17:14:40 -07:00
Krrish Dholakia
f7004a94df fix(proxy_server.py): fix max budget check to also fire slack alert 2024-03-29 16:24:40 -07:00
Krrish Dholakia
fbe4d6d332 fix(proxy_server.py): fix checks 2024-03-29 15:34:13 -07:00
Krrish Dholakia
2d5201aa4e fix(auth_checks.py): make global spend checks more accurate 2024-03-29 14:57:44 -07:00
Ishaan Jaff
f6cb76c110 (fix) show correct spend on ui 2024-03-29 09:41:00 -07:00
Krrish Dholakia
48df7003a0 fix(proxy_server.py): handle pydantic json versioning issue 2024-03-28 23:23:50 -07:00
Krrish Dholakia
dc9dc3b1f6 fix(proxy_server.py): fix pydantic versioning issue 2024-03-28 22:42:26 -07:00
Krrish Dholakia
737bb3e444 fix(proxy_server.py): fix tpm/rpm limiting for jwt auth
fixes tpm/rpm limiting for jwt auth and implements unit tests for jwt auth
2024-03-28 21:19:34 -07:00
Krrish Dholakia
6848e3b1d2 fix(proxy_server.py): enable spend tracking for team-based jwt auth 2024-03-28 20:16:22 -07:00
Krrish Dholakia
5634a8c7e6 fix(proxy_server.py): return deleted team id's to confirm /team/delete worked 2024-03-28 16:09:33 -07:00
Krrish Dholakia
9717aa1dad docs(proxy_server.py): add update team TPM/RPM limit to swagger docs 2024-03-28 16:02:51 -07:00
Krrish Dholakia
ec49ebb72b fix(proxy_server.py): allow updating tpm/rpm limits via /team/update 2024-03-28 15:59:35 -07:00
Krrish Dholakia
501d0a16f5 docs(proxy_server.py): improve swagger docs for /team/new 2024-03-28 15:50:38 -07:00
Krrish Dholakia
0e76e6161a docs(proxy_server.py): improve swagger doc for /team/new 2024-03-28 15:39:48 -07:00
Krrish Dholakia
be8dd499c3 fix(proxy_server.py): fix user_id_information none check 2024-03-28 15:18:52 -07:00
Ishaan Jaff
cab28e874a Merge pull request #2735 from BerriAI/admin_ui_clearly_show_models
Admin UI clearly show models
2024-03-28 15:14:38 -07:00
Ishaan Jaff
124fb924ca (fix) actually view the spend report on the UI 2024-03-28 15:08:21 -07:00
Krish Dholakia
ddac479ae7 Merge branch 'main' into litellm_auto_create_user_fix 2024-03-28 14:57:39 -07:00
Krish Dholakia
b828290c81 Merge pull request #2722 from BerriAI/litellm_db_perf_improvement
feat(proxy/utils.py): enable updating db in a separate server
2024-03-28 14:56:14 -07:00
Ishaan Jaff
0a38044f0c (feat) return team alias in keys 2024-03-28 10:16:07 -07:00
Ishaan Jaff
61f897b446 (feat) return team alias when hitting user/info 2024-03-28 09:41:15 -07:00
Krrish Dholakia
e87c5f5d6f fix(proxy_server.py): allow user to pass in spend logs collector url 2024-03-28 09:14:30 -07:00
Ishaan Jaff
3044f3693d (fix) raise exception when master key not set + trying to use ui 2024-03-27 22:25:53 -07:00
Ishaan Jaff
f3cf29264f (fix) show user their role when rejecting /team/new requests 2024-03-27 20:54:07 -07:00
Ishaan Jaff
a4655e32f6 Merge pull request #2723 from BerriAI/litellm_proxy_perf_imp
[FEAT] Improve Proxy Perf - access router model names in constant time
2024-03-27 20:48:31 -07:00
Krrish Dholakia
27d03f3ed9 fix(proxy_server.py): fix linting error 2024-03-27 19:54:25 -07:00
Krrish Dholakia
14bd7a8c37 fix(proxy_server.py): fix budget add logic to accurately log who created it 2024-03-27 19:43:15 -07:00
Krish Dholakia
265d32679e Merge pull request #2725 from BerriAI/litellm_calculate_spend
feat(proxy_server.py): new `/spend/calculate` endpoint
2024-03-27 19:39:07 -07:00
Krish Dholakia
bc2df42e52 Merge pull request #2726 from BerriAI/litellm_enforce_user_param
feat(auth_checks.py): enable admin to enforce 'user' param for all openai endpoints
2024-03-27 19:38:52 -07:00
Krrish Dholakia
0417ce6cbe feat(auth_checks.py): enable admin to enforce 'user' param for all openai endpoints 2024-03-27 17:36:27 -07:00
Krrish Dholakia
4da70d223e feat(proxy_server.py): new /spend/calculate endpoint
Allows user to calculate spend before making the call
2024-03-27 17:08:55 -07:00
Krrish Dholakia
3df60e856c fix(proxy_server.py): don't auto-create user when creating key 2024-03-27 16:48:57 -07:00