Commit graph

1364 commits

Author SHA1 Message Date
Krrish Dholakia
90e17b5422 fix(handle_jwt.py): track spend for user using jwt auth 2024-03-20 10:55:52 -07:00
Ishaan Jaff
4ed551dc52 (feat) better debugging for /cache/ping 2024-03-20 08:30:11 -07:00
Ishaan Jaff
2256ece5a9 (feat) litellm cache ping 2024-03-20 08:24:13 -07:00
Ishaan Jaff
8f750b71eb (fix) caching - don't require cache password 2024-03-19 20:50:16 -07:00
Krrish Dholakia
f25b03326b fix(proxy_server.py): allow user to disable scheduled reset budget task 2024-03-19 20:36:22 -07:00
Krrish Dholakia
2dfdc8dd69 Revert "Merge pull request #2593 from BerriAI/litellm_reset_budget_fix"
This reverts commit afd363129f, reversing
changes made to c94bc94ad5.
2024-03-19 20:25:41 -07:00
Krish Dholakia
afd363129f
Merge pull request #2593 from BerriAI/litellm_reset_budget_fix
fix(proxy/utils.py): fix reset budget logic
2024-03-19 20:17:03 -07:00
Krrish Dholakia
37795c0d92 fix(proxy_server.py): add more debug logs 2024-03-19 19:59:43 -07:00
Krrish Dholakia
f6de3a0359 fix: better debug logs 2024-03-19 19:28:26 -07:00
Ishaan Jaff
c94bc94ad5
Merge pull request #2591 from BerriAI/litellm_metrics_endpoint
[Feat] /metrics endpoint for Prometheus, Grafana
2024-03-19 18:08:22 -07:00
Ishaan Jaff
aa1c480452 (feat) using prom litellm 2024-03-19 15:49:23 -07:00
Krrish Dholakia
302bab6f1f feat(handle_jwt.py): support authenticating admins into the proxy via jwt's 2024-03-19 15:00:27 -07:00
Ishaan Jaff
4b7e102187 (v0) prometheus metric 2024-03-19 14:48:38 -07:00
Krrish Dholakia
7c74a0e6e2 fix(proxy_server.py): expose disable_spend_logs flag in config general settings
Writing each spend log adds +300ms latency

https://github.com/BerriAI/litellm/issues/1714#issuecomment-1924727281
2024-03-19 12:08:37 -07:00
Krish Dholakia
c4dbd0407e
Merge pull request #2561 from BerriAI/litellm_batch_writing_db
fix(proxy/utils.py): move to batch writing db updates
2024-03-18 21:50:47 -07:00
Krrish Dholakia
7eaddaef10 refactor(proxy_server.py): re-add custom db client logic - prevent regressions 2024-03-18 21:16:28 -07:00
Ishaan Jaff
51d658e878 (fix) if litellm-proxy-budget set use it 2024-03-18 20:31:23 -07:00
Krrish Dholakia
f588bff69b fix(proxy_server.py): fix spend log update 2024-03-18 20:26:28 -07:00
Ishaan Jaff
87dd3f1235 (fix) show global spend on UI 2024-03-18 18:15:08 -07:00
Krrish Dholakia
8fefe625d9 fix(proxy/utils.py): batch writing updates to db 2024-03-18 16:47:02 -07:00
Krrish Dholakia
f0434350f1 fix(proxy_server.py): don't override cache params on proxy config if set 2024-03-18 12:11:30 -07:00
Krrish Dholakia
8619499853 fix(proxy_server.py): ignore cache if value is false 2024-03-18 11:15:32 -07:00
Ikko Eltociear Ashimine
0b561764b0
Update proxy_server.py
intialize -> initialize
2024-03-18 01:28:12 +09:00
Krrish Dholakia
077b9c6234 fix(proxy/utils.py): move to batch writing db updates 2024-03-16 22:32:00 -07:00
Ishaan Jaff
4e33aac997
Merge pull request #2560 from BerriAI/litellm_view_team_based_spend
Admin UI - view team based spend
2024-03-16 19:38:17 -07:00
Ishaan Jaff
0d349d389a (feat) new ui build 2024-03-16 19:35:00 -07:00
Ishaan Jaff
902337e28a (feat) view team based spend 2024-03-16 19:06:16 -07:00
Krish Dholakia
e55a8c3570
Merge pull request #2556 from BerriAI/litellm_aws_secret_manager_support
fix(utils.py): initial commit for aws secret manager support
2024-03-16 18:41:58 -07:00
Krrish Dholakia
bc66ef9d5c fix(utils.py): fix aws secret manager + support key_management_settings
fixes the aws secret manager implementation and allows the user to set which keys they want to check thr
ough it
2024-03-16 16:47:50 -07:00
Ishaan Jaff
ac6c69ff89
Merge pull request #2559 from BerriAI/litellm_show_spend_correctly
(fix) admin ui - order spend by date
2024-03-16 16:43:13 -07:00
Ishaan Jaff
e3582d86c6
Merge pull request #2557 from BerriAI/litellm_clean_up_health_readiness
(fix) /health/readiness return success callback names as (str)
2024-03-16 16:38:01 -07:00
Ishaan Jaff
2355e9dc51 (fix) admin ui - order spend by date 2024-03-16 16:15:10 -07:00
Ishaan Jaff
cb4c36b7f6 (fix) /health readiness return callback names 2024-03-16 15:56:07 -07:00
Ishaan Jaff
732ab69d32
Merge pull request #2553 from BerriAI/litellm_improve_health_readiness_perf
(feat) Proxy - improve health readiness perf (93% faster)
2024-03-16 15:38:45 -07:00
Ishaan Jaff
389e79d3b2 (fix) improve /health/readiness under high load 2024-03-16 14:14:17 -07:00
Krrish Dholakia
3de766c071 fix(proxy_server.py): fix linting errors 2024-03-16 14:02:36 -07:00
Krrish Dholakia
844d6828c8 fix(proxy_server.py): rename beta end user blocking endpoints
separating end user management from user management
2024-03-16 13:53:10 -07:00
Ishaan Jaff
61f129b9ab (feat) improve health readiness perf 2024-03-16 13:10:04 -07:00
Krrish Dholakia
dd151869a3 fix(blocked_user_list.py): check if end user blocked in db 2024-03-16 13:03:52 -07:00
Krrish Dholakia
ef0002f31c fix(proxy_server.py): write blocked user list to a db table
lets this persist across workers
2024-03-16 12:26:29 -07:00
Krrish Dholakia
2c2db9ce89 fix(proxy_server.py): bug fix on getting user obj from cache 2024-03-16 11:07:38 -07:00
Krrish Dholakia
48f96dd563 fix(caching.py): close redis connection pool upon proxy shutdown 2024-03-16 10:39:58 -07:00
ishaan-jaff
177571bea3 (fix) backoff import 2024-03-16 08:53:06 -07:00
ishaan-jaff
2d7ca36ec4 (fix) errant print saying "Error importing .." msg 2024-03-15 21:47:52 -07:00
Krish Dholakia
4969ae0e9d
Merge pull request #2542 from BerriAI/litellm_redis_perf_improvements
89% Caching improvement - Async Redis completion calls + batch redis GET requests for a given key + call type
2024-03-15 18:58:36 -07:00
ishaan-jaff
fd93a68b0d (fix) fix error message raised on passing invalid tokens 2024-03-15 14:50:34 -07:00
Krrish Dholakia
226953e1d8 feat(batch_redis_get.py): batch redis GET requests for a given key + call type
reduces the number of GET requests we're making in high-throughput scenarios
2024-03-15 14:40:11 -07:00
Ishaan Jaff
31dcc6acf2
Merge pull request #2524 from BerriAI/litellm_fix_update_user
(fix) - update user error
2024-03-15 08:51:45 -07:00
ishaan-jaff
fa1676b253 (fix) error cli users see when importing enterprise folder 2024-03-15 08:10:45 -07:00
ishaan-jaff
634e0227f3 (fix) - update user error 2024-03-14 19:50:26 -07:00