Commit graph

926 commits

Author SHA1 Message Date
Krrish Dholakia
dd151869a3 fix(blocked_user_list.py): check if end user blocked in db 2024-03-16 13:03:52 -07:00
Krrish Dholakia
ef0002f31c fix(proxy_server.py): write blocked user list to a db table
lets this persist across workers
2024-03-16 12:26:29 -07:00
Krrish Dholakia
2c2db9ce89 fix(proxy_server.py): bug fix on getting user obj from cache 2024-03-16 11:07:38 -07:00
Krrish Dholakia
48f96dd563 fix(caching.py): close redis connection pool upon proxy shutdown 2024-03-16 10:39:58 -07:00
ishaan-jaff
177571bea3 (fix) backoff import 2024-03-16 08:53:06 -07:00
ishaan-jaff
2d7ca36ec4 (fix) errant print saying "Error importing .." msg 2024-03-15 21:47:52 -07:00
Krish Dholakia
4969ae0e9d
Merge pull request #2542 from BerriAI/litellm_redis_perf_improvements
89% Caching improvement - Async Redis completion calls + batch redis GET requests for a given key + call type
2024-03-15 18:58:36 -07:00
ishaan-jaff
fd93a68b0d (fix) fix error message raised on passing invalid tokens 2024-03-15 14:50:34 -07:00
Krrish Dholakia
226953e1d8 feat(batch_redis_get.py): batch redis GET requests for a given key + call type
reduces the number of GET requests we're making in high-throughput scenarios
2024-03-15 14:40:11 -07:00
Ishaan Jaff
31dcc6acf2
Merge pull request #2524 from BerriAI/litellm_fix_update_user
(fix) - update user error
2024-03-15 08:51:45 -07:00
ishaan-jaff
fa1676b253 (fix) error cli users see when importing enterprise folder 2024-03-15 08:10:45 -07:00
ishaan-jaff
634e0227f3 (fix) - update user error 2024-03-14 19:50:26 -07:00
ishaan-jaff
1b63748831 (temp) test build without using argon2 2024-03-14 18:53:16 -07:00
Krrish Dholakia
d6537a05ca fix(caching.py): fix print statements 2024-03-14 12:58:34 -07:00
Krrish Dholakia
704573c3f6 fix(proxy_server.py): improve error message on ui login error 2024-03-14 10:02:41 -07:00
ishaan-jaff
4006d10b7b (fix) importing PromptInjectionDetection 2024-03-13 21:24:37 -07:00
Krish Dholakia
d8eff53ebe
Merge pull request #2506 from BerriAI/litellm_update_db_perf_improvements
fix(proxy_server.py): move to using UPDATE + SET for track_cost_callback
2024-03-13 20:55:40 -07:00
Krrish Dholakia
1b807fa3f5 fix(proxy_server.py): fix key caching logic 2024-03-13 19:10:24 -07:00
Krrish Dholakia
acc672a78f fix(proxy_server.py): maintain support for model specific budgets 2024-03-13 17:04:51 -07:00
Krrish Dholakia
cf090acb25 fix(proxy_server.py): move to using UPDATE + SET for track_cost_callback 2024-03-13 16:13:37 -07:00
Ishaan Jaff
8a886c6e93
Merge pull request #2501 from BerriAI/litellm_fix_using_enterprise_docker
(fix) using enterprise folder on litellm Docker
2024-03-13 14:26:21 -07:00
Ishaan Jaff
d82be720d2
Merge pull request #2493 from BerriAI/litellm_return_429_no_models_available
[Proxy] return 429 when no models available
2024-03-13 13:33:43 -07:00
Krish Dholakia
3e66b50602
Merge pull request #2498 from BerriAI/litellm_prompt_injection_detection
feat(prompt_injection_detection.py): support simple heuristic similarity check for prompt injection attacks
2024-03-13 12:28:19 -07:00
ishaan-jaff
82246d8e30 (fix) using enterprise folder on litellm 2024-03-13 12:16:58 -07:00
Krrish Dholakia
234cdbbfef feat(prompt_injection_detection.py): support simple heuristic similarity check for prompt injection attacks 2024-03-13 10:32:21 -07:00
ishaan-jaff
771d09312e (fix) issue with using litellm enterprise license 2024-03-13 10:30:31 -07:00
ishaan-jaff
4c526ade27 (fix) errors fro litellm proxy 2024-03-13 08:05:32 -07:00
ishaan-jaff
3aeada232e (fix) return 429 error 2024-03-13 08:03:28 -07:00
Krish Dholakia
8beb6bb2bc
Merge pull request #2382 from BerriAI/litellm_cost_tracking_fix
fix(proxy_server.py): support cost tracking if general_settings is none
2024-03-12 19:15:23 -07:00
Krrish Dholakia
39f9bfad80 fix(proxy_server.py): cache master key check 2024-03-12 15:15:33 -07:00
ishaan-jaff
6f069d26f3 (fix) patch dynamoDB tea_model_alias bug 2024-03-12 11:17:12 -07:00
Krrish Dholakia
fd4086d5da fix(proxy_server.py): fix /user/info for non-existent user id 2024-03-12 11:05:41 -07:00
Krrish Dholakia
d1644db8ce test(test_key_generate_prisma.py): fix test to only let admin delete a key 2024-03-11 14:18:01 -07:00
Krrish Dholakia
2addd66393 fix(proxy_server.py): bug fix 2024-03-11 13:54:58 -07:00
Krrish Dholakia
4eb244c3ca fix(proxy_server.py): prevent user from deleting non-user owned keys when they use ui 2024-03-11 12:13:30 -07:00
Krrish Dholakia
4b64e506f4 fix(proxy_server.py): fix argon import 2024-03-11 11:33:34 -07:00
Krish Dholakia
c7d0af0a2e
Merge pull request #2426 from BerriAI/litellm_whisper_cost_tracking
feat: add cost tracking + caching for `/audio/transcription` calls
2024-03-09 19:12:06 -08:00
Krish Dholakia
c022568a3a
Merge branch 'main' into litellm_faster_api_key_checking 2024-03-09 18:45:03 -08:00
Krrish Dholakia
7a29fe9525 fix(proxy_server.py): check if master key is str before hashing 2024-03-09 16:51:11 -08:00
Krrish Dholakia
03f0c968f9 fix(proxy_server.py): fix argon exceptions 2024-03-09 16:16:40 -08:00
Krrish Dholakia
fa45c569fd feat: add cost tracking + caching for transcription calls 2024-03-09 15:43:38 -08:00
Krish Dholakia
caa99f43bf
Merge branch 'main' into litellm_load_balancing_transcription_endpoints 2024-03-08 23:08:47 -08:00
Krrish Dholakia
5ffbcf79d3 fix(proxy_server.py): fix argon cfi checking 2024-03-08 22:01:44 -08:00
Ishaan Jaff
8036b48f14
Merge pull request #2408 from BerriAI/litellm_no_store_reqs
[FEAT-liteLLM Proxy] Incognito Requests -  Don't log anything
2024-03-08 21:11:43 -08:00
Krrish Dholakia
0fb7afe820 feat(proxy_server.py): working /audio/transcription endpoint 2024-03-08 18:20:27 -08:00
ishaan-jaff
d6dc28f0ed (fix) proxy setting success callbacks 2024-03-08 16:27:53 -08:00
Krrish Dholakia
cc0294b2f2 fix(proxy_server.py): fix tagging of endpoints 2024-03-08 14:29:31 -08:00
Krrish Dholakia
8c6d5b7f16 feat(proxy_server.py): supports /audio/transcription endpoint on proxy 2024-03-08 14:28:07 -08:00
ishaan-jaff
2aafbe390b (feat) read passed api_version 2024-03-08 13:16:12 -08:00
Krrish Dholakia
0cf056f493 fix(proxy_server.py): use argon2 for faster api key checking
0.04s latency boost on load test
2024-03-07 21:48:18 -08:00