Ishaan Jaff
|
c2b9799e42
|
Merge pull request #2788 from BerriAI/litellm_support_-_models
[Feat] Allow using model = * on proxy config.yaml
|
2024-04-01 19:46:50 -07:00 |
|
Krish Dholakia
|
6d3bee777e
|
Merge pull request #2787 from BerriAI/litellm_optional_team_jwt_claim
fix(proxy_server.py): don't require scope for team-based jwt access
|
2024-04-01 19:16:39 -07:00 |
|
Ishaan Jaff
|
716fcd3ec4
|
(fix) allow wildcard models
|
2024-04-01 19:07:05 -07:00 |
|
Krrish Dholakia
|
abf87fe448
|
fix(proxy_server.py): don't require scope for team-based jwt access
If team with the client_id exists then it should be allowed to make a request, if it doesn't then as we discussed it should return an error
|
2024-04-01 18:52:00 -07:00 |
|
Ishaan Jaff
|
96d7377824
|
Merge pull request #2785 from BerriAI/litellm_high_traffic_redis_caching_fixes
[Feat] Proxy - high traffic redis caching - when using `url`
|
2024-04-01 18:38:27 -07:00 |
|
Krrish Dholakia
|
e06d43dc90
|
fix(tpm_rpm_limiter.py): fix cache init logic
|
2024-04-01 18:01:38 -07:00 |
|
Ishaan Jaff
|
86ba03e08a
|
add /cache/redis/info endpoint
|
2024-04-01 16:51:23 -07:00 |
|
Ishaan Jaff
|
aff63e556c
|
(fix) _update_end_user_cache
|
2024-04-01 11:18:00 -07:00 |
|
Krrish Dholakia
|
f68658b2da
|
test(test_max_tpm_rpm_limiter.py): add unit testing for redis namespaces working for tpm/rpm limits
|
2024-04-01 10:39:03 -07:00 |
|
Krish Dholakia
|
f635d15c2b
|
Merge pull request #2775 from BerriAI/litellm_redis_user_api_key_cache_v3
fix(tpm_rpm_limiter.py): enable redis caching for tpm/rpm checks on keys/user/teams
|
2024-03-30 22:07:05 -07:00 |
|
Krrish Dholakia
|
555f0af027
|
fix(tpm_rpm_limiter.py): enable redis caching for tpm/rpm checks on keys/user/teams
allows tpm/rpm checks to work across instances
https://github.com/BerriAI/litellm/issues/2730
|
2024-03-30 20:01:36 -07:00 |
|
Ishaan Jaff
|
e1f3634855
|
(ui) show proxy spend
|
2024-03-30 14:02:43 -07:00 |
|
Ishaan Jaff
|
9e0e389109
|
(ui) view spend by team name on usage
|
2024-03-30 13:25:32 -07:00 |
|
Krrish Dholakia
|
67f6f8871d
|
fix(proxy_server.py): support azure openai text completion calls
|
2024-03-30 11:30:06 -07:00 |
|
Krrish Dholakia
|
cda0c49b20
|
fix(proxy_server.py): fix /key/update endpoint to update key duration
also adds a test for this to our ci/cd
|
2024-03-29 21:47:10 -07:00 |
|
Krish Dholakia
|
6394a001a2
|
Merge pull request #2757 from BerriAI/litellm_fix_budget_alerts
fix(auth_checks.py): make global spend checks more accurate
|
2024-03-29 21:13:27 -07:00 |
|
Krrish Dholakia
|
5ad1251b4b
|
docs(proxy_server.py): fix example on swagger for team member delete
|
2024-03-29 20:09:54 -07:00 |
|
Krrish Dholakia
|
9fc8b8a215
|
fix(proxy_server.py): increment cached global proxy spend object
|
2024-03-29 20:02:31 -07:00 |
|
Krrish Dholakia
|
49e2624240
|
fix(proxy_server.py): enforce end user budgets with 'litellm.max_end_user_budget' param
|
2024-03-29 17:14:40 -07:00 |
|
Krrish Dholakia
|
f7004a94df
|
fix(proxy_server.py): fix max budget check to also fire slack alert
|
2024-03-29 16:24:40 -07:00 |
|
Krrish Dholakia
|
fbe4d6d332
|
fix(proxy_server.py): fix checks
|
2024-03-29 15:34:13 -07:00 |
|
Krrish Dholakia
|
2d5201aa4e
|
fix(auth_checks.py): make global spend checks more accurate
|
2024-03-29 14:57:44 -07:00 |
|
Ishaan Jaff
|
f6cb76c110
|
(fix) show correct spend on ui
|
2024-03-29 09:41:00 -07:00 |
|
Krrish Dholakia
|
48df7003a0
|
fix(proxy_server.py): handle pydantic json versioning issue
|
2024-03-28 23:23:50 -07:00 |
|
Krrish Dholakia
|
dc9dc3b1f6
|
fix(proxy_server.py): fix pydantic versioning issue
|
2024-03-28 22:42:26 -07:00 |
|
Krrish Dholakia
|
737bb3e444
|
fix(proxy_server.py): fix tpm/rpm limiting for jwt auth
fixes tpm/rpm limiting for jwt auth and implements unit tests for jwt auth
|
2024-03-28 21:19:34 -07:00 |
|
Krrish Dholakia
|
6848e3b1d2
|
fix(proxy_server.py): enable spend tracking for team-based jwt auth
|
2024-03-28 20:16:22 -07:00 |
|
Krrish Dholakia
|
5634a8c7e6
|
fix(proxy_server.py): return deleted team id's to confirm /team/delete worked
|
2024-03-28 16:09:33 -07:00 |
|
Krrish Dholakia
|
9717aa1dad
|
docs(proxy_server.py): add update team TPM/RPM limit to swagger docs
|
2024-03-28 16:02:51 -07:00 |
|
Krrish Dholakia
|
ec49ebb72b
|
fix(proxy_server.py): allow updating tpm/rpm limits via /team/update
|
2024-03-28 15:59:35 -07:00 |
|
Krrish Dholakia
|
501d0a16f5
|
docs(proxy_server.py): improve swagger docs for /team/new
|
2024-03-28 15:50:38 -07:00 |
|
Krrish Dholakia
|
0e76e6161a
|
docs(proxy_server.py): improve swagger doc for /team/new
|
2024-03-28 15:39:48 -07:00 |
|
Krrish Dholakia
|
be8dd499c3
|
fix(proxy_server.py): fix user_id_information none check
|
2024-03-28 15:18:52 -07:00 |
|
Ishaan Jaff
|
cab28e874a
|
Merge pull request #2735 from BerriAI/admin_ui_clearly_show_models
Admin UI clearly show models
|
2024-03-28 15:14:38 -07:00 |
|
Ishaan Jaff
|
124fb924ca
|
(fix) actually view the spend report on the UI
|
2024-03-28 15:08:21 -07:00 |
|
Krish Dholakia
|
ddac479ae7
|
Merge branch 'main' into litellm_auto_create_user_fix
|
2024-03-28 14:57:39 -07:00 |
|
Krish Dholakia
|
b828290c81
|
Merge pull request #2722 from BerriAI/litellm_db_perf_improvement
feat(proxy/utils.py): enable updating db in a separate server
|
2024-03-28 14:56:14 -07:00 |
|
Ishaan Jaff
|
0a38044f0c
|
(feat) return team alias in keys
|
2024-03-28 10:16:07 -07:00 |
|
Ishaan Jaff
|
61f897b446
|
(feat) return team alias when hitting user/info
|
2024-03-28 09:41:15 -07:00 |
|
Krrish Dholakia
|
e87c5f5d6f
|
fix(proxy_server.py): allow user to pass in spend logs collector url
|
2024-03-28 09:14:30 -07:00 |
|
Ishaan Jaff
|
3044f3693d
|
(fix) raise exception when master key not set + trying to use ui
|
2024-03-27 22:25:53 -07:00 |
|
Ishaan Jaff
|
f3cf29264f
|
(fix) show user their role when rejecting /team/new requests
|
2024-03-27 20:54:07 -07:00 |
|
Ishaan Jaff
|
a4655e32f6
|
Merge pull request #2723 from BerriAI/litellm_proxy_perf_imp
[FEAT] Improve Proxy Perf - access router model names in constant time
|
2024-03-27 20:48:31 -07:00 |
|
Krrish Dholakia
|
27d03f3ed9
|
fix(proxy_server.py): fix linting error
|
2024-03-27 19:54:25 -07:00 |
|
Krrish Dholakia
|
14bd7a8c37
|
fix(proxy_server.py): fix budget add logic to accurately log who created it
|
2024-03-27 19:43:15 -07:00 |
|
Krish Dholakia
|
265d32679e
|
Merge pull request #2725 from BerriAI/litellm_calculate_spend
feat(proxy_server.py): new `/spend/calculate` endpoint
|
2024-03-27 19:39:07 -07:00 |
|
Krish Dholakia
|
bc2df42e52
|
Merge pull request #2726 from BerriAI/litellm_enforce_user_param
feat(auth_checks.py): enable admin to enforce 'user' param for all openai endpoints
|
2024-03-27 19:38:52 -07:00 |
|
Krrish Dholakia
|
0417ce6cbe
|
feat(auth_checks.py): enable admin to enforce 'user' param for all openai endpoints
|
2024-03-27 17:36:27 -07:00 |
|
Krrish Dholakia
|
4da70d223e
|
feat(proxy_server.py): new /spend/calculate endpoint
Allows user to calculate spend before making the call
|
2024-03-27 17:08:55 -07:00 |
|
Krrish Dholakia
|
3df60e856c
|
fix(proxy_server.py): don't auto-create user when creating key
|
2024-03-27 16:48:57 -07:00 |
|