Commit graph

951 commits

Author SHA1 Message Date
Ishaan Jaff
3f28e932d9 (v0) prometheus metric 2024-03-19 14:48:38 -07:00
Krrish Dholakia
53ed290ed3 fix(proxy_server.py): expose disable_spend_logs flag in config general settings
Writing each spend log adds +300ms latency

https://github.com/BerriAI/litellm/issues/1714#issuecomment-1924727281
2024-03-19 12:08:37 -07:00
Krish Dholakia
f522a5236b Merge pull request #2561 from BerriAI/litellm_batch_writing_db
fix(proxy/utils.py): move to batch writing db updates
2024-03-18 21:50:47 -07:00
Krrish Dholakia
e20ee870d2 refactor(proxy_server.py): re-add custom db client logic - prevent regressions 2024-03-18 21:16:28 -07:00
Ishaan Jaff
1a3a19e195 (fix) if litellm-proxy-budget set use it 2024-03-18 20:31:23 -07:00
Krrish Dholakia
6c578db2c2 fix(proxy_server.py): fix spend log update 2024-03-18 20:26:28 -07:00
Ishaan Jaff
d2f3813e54 (fix) show global spend on UI 2024-03-18 18:15:08 -07:00
Krrish Dholakia
1b10123f07 fix(proxy/utils.py): batch writing updates to db 2024-03-18 16:47:02 -07:00
Krrish Dholakia
7f74cf0057 fix(proxy_server.py): don't override cache params on proxy config if set 2024-03-18 12:11:30 -07:00
Krrish Dholakia
23d088a1da fix(proxy_server.py): ignore cache if value is false 2024-03-18 11:15:32 -07:00
Krrish Dholakia
1618751824 fix(proxy/utils.py): move to batch writing db updates 2024-03-16 22:32:00 -07:00
Ishaan Jaff
0da6e78cb7 Merge pull request #2560 from BerriAI/litellm_view_team_based_spend
Admin UI - view team based spend
2024-03-16 19:38:17 -07:00
Ishaan Jaff
e6bebfda8f (feat) new ui build 2024-03-16 19:35:00 -07:00
Ishaan Jaff
12c68060c9 (feat) view team based spend 2024-03-16 19:06:16 -07:00
Krish Dholakia
293c32204e Merge pull request #2556 from BerriAI/litellm_aws_secret_manager_support
fix(utils.py): initial commit for aws secret manager support
2024-03-16 18:41:58 -07:00
Krrish Dholakia
bad2327b88 fix(utils.py): fix aws secret manager + support key_management_settings
fixes the aws secret manager implementation and allows the user to set which keys they want to check thr
ough it
2024-03-16 16:47:50 -07:00
Ishaan Jaff
3cc579dcbb Merge pull request #2559 from BerriAI/litellm_show_spend_correctly
(fix) admin ui - order spend by date
2024-03-16 16:43:13 -07:00
Ishaan Jaff
6253b45517 Merge pull request #2557 from BerriAI/litellm_clean_up_health_readiness
(fix) /health/readiness return success callback names as (str)
2024-03-16 16:38:01 -07:00
Ishaan Jaff
d6d9d69618 (fix) admin ui - order spend by date 2024-03-16 16:15:10 -07:00
Ishaan Jaff
e11e42a0ec (fix) /health readiness return callback names 2024-03-16 15:56:07 -07:00
Ishaan Jaff
dd17c5bd1a Merge pull request #2553 from BerriAI/litellm_improve_health_readiness_perf
(feat) Proxy - improve health readiness perf (93% faster)
2024-03-16 15:38:45 -07:00
Ishaan Jaff
e71a5b4d24 (fix) improve /health/readiness under high load 2024-03-16 14:14:17 -07:00
Krrish Dholakia
e37c97607b fix(proxy_server.py): fix linting errors 2024-03-16 14:02:36 -07:00
Krrish Dholakia
928ea59fac fix(proxy_server.py): rename beta end user blocking endpoints
separating end user management from user management
2024-03-16 13:53:10 -07:00
Ishaan Jaff
f6418b9b34 (feat) improve health readiness perf 2024-03-16 13:10:04 -07:00
Krrish Dholakia
be33918438 fix(blocked_user_list.py): check if end user blocked in db 2024-03-16 13:03:52 -07:00
Krrish Dholakia
588b252cab fix(proxy_server.py): write blocked user list to a db table
lets this persist across workers
2024-03-16 12:26:29 -07:00
Krrish Dholakia
8ebf4a62dc fix(proxy_server.py): bug fix on getting user obj from cache 2024-03-16 11:07:38 -07:00
Krrish Dholakia
27de1089a6 fix(caching.py): close redis connection pool upon proxy shutdown 2024-03-16 10:39:58 -07:00
ishaan-jaff
27368f9500 (fix) backoff import 2024-03-16 08:53:06 -07:00
ishaan-jaff
f71ea87cf1 (fix) errant print saying "Error importing .." msg 2024-03-15 21:47:52 -07:00
Krish Dholakia
8e6e7f880d Merge pull request #2542 from BerriAI/litellm_redis_perf_improvements
89% Caching improvement - Async Redis completion calls + batch redis GET requests for a given key + call type
2024-03-15 18:58:36 -07:00
ishaan-jaff
37be2ba569 (fix) fix error message raised on passing invalid tokens 2024-03-15 14:50:34 -07:00
Krrish Dholakia
8d1c60bfdc feat(batch_redis_get.py): batch redis GET requests for a given key + call type
reduces the number of GET requests we're making in high-throughput scenarios
2024-03-15 14:40:11 -07:00
Ishaan Jaff
607292596b Merge pull request #2524 from BerriAI/litellm_fix_update_user
(fix) - update user error
2024-03-15 08:51:45 -07:00
ishaan-jaff
2893d289a0 (fix) error cli users see when importing enterprise folder 2024-03-15 08:10:45 -07:00
ishaan-jaff
084fdf5cb9 (fix) - update user error 2024-03-14 19:50:26 -07:00
ishaan-jaff
fe182d2115 (temp) test build without using argon2 2024-03-14 18:53:16 -07:00
Krrish Dholakia
8d4b7b60bf fix(caching.py): fix print statements 2024-03-14 12:58:34 -07:00
Krrish Dholakia
0f1ff55a64 fix(proxy_server.py): improve error message on ui login error 2024-03-14 10:02:41 -07:00
ishaan-jaff
c8fbadeec2 (fix) importing PromptInjectionDetection 2024-03-13 21:24:37 -07:00
Krish Dholakia
8436507616 Merge pull request #2506 from BerriAI/litellm_update_db_perf_improvements
fix(proxy_server.py): move to using UPDATE + SET for track_cost_callback
2024-03-13 20:55:40 -07:00
Krrish Dholakia
3232feb123 fix(proxy_server.py): fix key caching logic 2024-03-13 19:10:24 -07:00
Krrish Dholakia
8f6af575e7 fix(proxy_server.py): maintain support for model specific budgets 2024-03-13 17:04:51 -07:00
Krrish Dholakia
1a6154ec74 fix(proxy_server.py): move to using UPDATE + SET for track_cost_callback 2024-03-13 16:13:37 -07:00
Ishaan Jaff
94553ca0b0 Merge pull request #2501 from BerriAI/litellm_fix_using_enterprise_docker
(fix) using enterprise folder on litellm Docker
2024-03-13 14:26:21 -07:00
Ishaan Jaff
a7a74047a9 Merge pull request #2493 from BerriAI/litellm_return_429_no_models_available
[Proxy] return 429 when no models available
2024-03-13 13:33:43 -07:00
Krish Dholakia
42b3577fd0 Merge pull request #2498 from BerriAI/litellm_prompt_injection_detection
feat(prompt_injection_detection.py): support simple heuristic similarity check for prompt injection attacks
2024-03-13 12:28:19 -07:00
ishaan-jaff
ede01e9c8e (fix) using enterprise folder on litellm 2024-03-13 12:16:58 -07:00
Krrish Dholakia
5769bd22c3 feat(prompt_injection_detection.py): support simple heuristic similarity check for prompt injection attacks 2024-03-13 10:32:21 -07:00