Commit graph

3137 commits

Author SHA1 Message Date
Krrish Dholakia
f08bb7e41f fix(utils.py): exclude s3 caching from individual item caching for embedding list
can't bulk upload to s3, so this will slow down calls

https://github.com/BerriAI/litellm/pull/1417
2024-01-13 16:19:30 +05:30
Krrish Dholakia
0bcca3fed3 refactor(main.py): trigger rebuild 2024-01-13 15:55:56 +05:30
Krrish Dholakia
79cc739b53 fix(caching.py): fix async in-memory caching 2024-01-13 15:33:57 +05:30
Krrish Dholakia
cdadac1649 fix(caching.py): return updated kwargs from get_cache helper function 2024-01-13 15:04:34 +05:30
Krrish Dholakia
9e6451e8f9 fix(conftest.py): create an event loop if one isn't made 2024-01-13 14:23:04 +05:30
Krrish Dholakia
d41b84ec40 test(conftest.py): create an event loop if one doesn't exist 2024-01-13 14:21:24 +05:30
Krrish Dholakia
0182dee42b fix(caching.py): remove print verbose statement 2024-01-13 14:11:05 +05:30
Krrish Dholakia
880f829013 fix(caching.py): use bulk writes and blockconnectionpooling for reads from Redis 2024-01-13 11:50:50 +05:30
Krrish Dholakia
813fb19620 fix: support async redis caching 2024-01-12 21:46:41 +05:30
Krish Dholakia
7ecfc09221 Merge branch 'main' into litellm_embedding_caching_updates 2024-01-11 23:58:51 +05:30
ishaan-jaff
eb8b00b2a5 (test) caching for bedrock/embedding str inputs 2024-01-11 23:12:57 +05:30
ishaan-jaff
b4665c80e5 (test) bedrock - embedding with strings 2024-01-11 23:04:41 +05:30
ishaan-jaff
1f04446222 (fix) bedrock - embedding - support str input 2024-01-11 23:02:12 +05:30
ishaan-jaff
9aac1de191 v0 2024-01-11 22:56:18 +05:30
Ishaan Jaff
8b0f93dd2d Merge pull request #1422 from dleen/httpx
(fix) create httpx.Request instead of httpx.request
2024-01-11 22:31:55 +05:30
David Leen
3cd5ef9a02 (fix) create httpx.Request instead of httpx.request
fixes #1420
2024-01-11 16:22:26 +01:00
Krrish Dholakia
107ba5aece fix(utils.py): use preset cache key for async calls as well 2024-01-11 19:30:28 +05:30
Krrish Dholakia
36068b707a fix(proxy_cli.py): read db url from config, not just environment 2024-01-11 19:19:29 +05:30
Krrish Dholakia
4ed9f2b712 fix(utils.py): bug fixes 2024-01-11 19:02:17 +05:30
Krrish Dholakia
d0e3cd7e13 test(test_custom_callback_input.py): make test more verbsoe
n
2024-01-11 18:44:58 +05:30
Krrish Dholakia
f3b7e98da7 fix(main.py): init custom llm provider earlier 2024-01-11 18:30:10 +05:30
Krrish Dholakia
63298d6c75 fix(utils.py): bug fixes 2024-01-11 18:14:22 +05:30
ishaan-jaff
50aeec537c (feat) improve litellm verbose logs 2024-01-11 18:13:08 +05:30
ishaan-jaff
859eb0d98e (feat) show args passed to litellm.completion, acompletion on call 2024-01-11 17:56:27 +05:30
ishaan-jaff
14c0e87e34 (test) improve s3 logging test 2024-01-11 16:57:51 +05:30
Krrish Dholakia
4de82617c0 fix(main.py): add back **kwargs for acompletion 2024-01-11 16:55:19 +05:30
Krrish Dholakia
66addb1a01 fix(utils.py): support caching individual items in embedding input list
https://github.com/BerriAI/litellm/issues/1350
2024-01-11 16:51:34 +05:30
Ishaan Jaff
75b987ffee Merge pull request #1413 from BerriAI/litellm_log_cache_hits
[Feat] Proxy - Log Cache Hits on success callbacks + Testing
2024-01-11 16:39:22 +05:30
Krrish Dholakia
1472dc3f54 fix: n 2024-01-11 16:30:05 +05:30
ishaan-jaff
0da0909f2d (feat) s3 logging - log cache hits 2024-01-11 15:57:54 +05:30
ishaan-jaff
fccd81a444 (fix) retry gemini-pro-vision 3 times 2024-01-11 14:39:08 +05:30
ishaan-jaff
c41b47dc8b (fix) acompletion kwargs type hints 2024-01-11 14:22:37 +05:30
Krish Dholakia
300237d677 Merge pull request #1415 from BerriAI/litellm_bump_httpx_pool_limits
fix(router.py): bump httpx pool limits
2024-01-11 13:46:31 +05:30
Krrish Dholakia
da2af9f45e fix(router.py): bump httpx pool limits 2024-01-11 12:51:29 +05:30
ishaan-jaff
29393fb512 (fix) acompletion typehints - pass kwargs 2024-01-11 11:49:55 +05:30
ishaan-jaff
a645e982ee (test) s3 log cache hits 2024-01-11 11:44:48 +05:30
ishaan-jaff
ae8b6b31e2 (fix) s3 log cache hits 2024-01-11 11:44:20 +05:30
ishaan-jaff
cea0d6c8b0 (fix) litellm.acompletion with type hints 2024-01-11 10:47:12 +05:30
Ishaan Jaff
6e1be43595 Merge pull request #1200 from MateoCamara/explicit-args-acomplete
feat: added explicit args to acomplete
2024-01-11 10:39:05 +05:30
ishaan-jaff
4a47b17ba2 (feat) proxy - support s3_callback_params 2024-01-11 09:57:47 +05:30
ishaan-jaff
1a3a6408ac (test) s3 logs for /chat/completions 2024-01-11 09:16:06 +05:30
ishaan-jaff
3f907b9835 (test) s3 logging 2024-01-11 08:58:03 +05:30
ishaan-jaff
426ce91481 (fix) working s3 logging 2024-01-11 08:57:32 +05:30
ishaan-jaff
a7fe37599e v0 2024-01-11 08:25:40 +05:30
Krrish Dholakia
3ad4a44803 test(test_tpm_rpm_routing.py): add more logging for the test 2024-01-11 00:43:14 +05:30
Krrish Dholakia
253842bc7a test(test_router.py): handle rate limiting error 2024-01-11 00:00:17 +05:30
Krrish Dholakia
e71154f286 fix(main.py): fix streaming completion token counting error 2024-01-10 23:44:35 +05:30
Krrish Dholakia
3ed296e2dd fix(utils.py): raise correct error for azure content blocked error 2024-01-10 23:31:51 +05:30
Krrish Dholakia
568f4ae461 test(test_lowest_latency_routing.py): use the correct cache key 2024-01-10 22:15:01 +05:30
Krrish Dholakia
3028d0f622 feat(proxy_cli.py): move print statements to show actually deployed port 2024-01-10 22:09:58 +05:30