Krrish Dholakia
|
f08bb7e41f
|
fix(utils.py): exclude s3 caching from individual item caching for embedding list
can't bulk upload to s3, so this will slow down calls
https://github.com/BerriAI/litellm/pull/1417
|
2024-01-13 16:19:30 +05:30 |
|
Krrish Dholakia
|
0bcca3fed3
|
refactor(main.py): trigger rebuild
|
2024-01-13 15:55:56 +05:30 |
|
Krrish Dholakia
|
79cc739b53
|
fix(caching.py): fix async in-memory caching
|
2024-01-13 15:33:57 +05:30 |
|
Krrish Dholakia
|
cdadac1649
|
fix(caching.py): return updated kwargs from get_cache helper function
|
2024-01-13 15:04:34 +05:30 |
|
Krrish Dholakia
|
9e6451e8f9
|
fix(conftest.py): create an event loop if one isn't made
|
2024-01-13 14:23:04 +05:30 |
|
Krrish Dholakia
|
d41b84ec40
|
test(conftest.py): create an event loop if one doesn't exist
|
2024-01-13 14:21:24 +05:30 |
|
Krrish Dholakia
|
0182dee42b
|
fix(caching.py): remove print verbose statement
|
2024-01-13 14:11:05 +05:30 |
|
Krrish Dholakia
|
880f829013
|
fix(caching.py): use bulk writes and blockconnectionpooling for reads from Redis
|
2024-01-13 11:50:50 +05:30 |
|
Krrish Dholakia
|
813fb19620
|
fix: support async redis caching
|
2024-01-12 21:46:41 +05:30 |
|
Krish Dholakia
|
7ecfc09221
|
Merge branch 'main' into litellm_embedding_caching_updates
|
2024-01-11 23:58:51 +05:30 |
|
ishaan-jaff
|
eb8b00b2a5
|
(test) caching for bedrock/embedding str inputs
|
2024-01-11 23:12:57 +05:30 |
|
ishaan-jaff
|
b4665c80e5
|
(test) bedrock - embedding with strings
|
2024-01-11 23:04:41 +05:30 |
|
ishaan-jaff
|
1f04446222
|
(fix) bedrock - embedding - support str input
|
2024-01-11 23:02:12 +05:30 |
|
ishaan-jaff
|
9aac1de191
|
v0
|
2024-01-11 22:56:18 +05:30 |
|
Ishaan Jaff
|
8b0f93dd2d
|
Merge pull request #1422 from dleen/httpx
(fix) create httpx.Request instead of httpx.request
|
2024-01-11 22:31:55 +05:30 |
|
David Leen
|
3cd5ef9a02
|
(fix) create httpx.Request instead of httpx.request
fixes #1420
|
2024-01-11 16:22:26 +01:00 |
|
Krrish Dholakia
|
107ba5aece
|
fix(utils.py): use preset cache key for async calls as well
|
2024-01-11 19:30:28 +05:30 |
|
Krrish Dholakia
|
36068b707a
|
fix(proxy_cli.py): read db url from config, not just environment
|
2024-01-11 19:19:29 +05:30 |
|
Krrish Dholakia
|
4ed9f2b712
|
fix(utils.py): bug fixes
|
2024-01-11 19:02:17 +05:30 |
|
Krrish Dholakia
|
d0e3cd7e13
|
test(test_custom_callback_input.py): make test more verbsoe
n
|
2024-01-11 18:44:58 +05:30 |
|
Krrish Dholakia
|
f3b7e98da7
|
fix(main.py): init custom llm provider earlier
|
2024-01-11 18:30:10 +05:30 |
|
Krrish Dholakia
|
63298d6c75
|
fix(utils.py): bug fixes
|
2024-01-11 18:14:22 +05:30 |
|
ishaan-jaff
|
50aeec537c
|
(feat) improve litellm verbose logs
|
2024-01-11 18:13:08 +05:30 |
|
ishaan-jaff
|
859eb0d98e
|
(feat) show args passed to litellm.completion, acompletion on call
|
2024-01-11 17:56:27 +05:30 |
|
ishaan-jaff
|
14c0e87e34
|
(test) improve s3 logging test
|
2024-01-11 16:57:51 +05:30 |
|
Krrish Dholakia
|
4de82617c0
|
fix(main.py): add back **kwargs for acompletion
|
2024-01-11 16:55:19 +05:30 |
|
Krrish Dholakia
|
66addb1a01
|
fix(utils.py): support caching individual items in embedding input list
https://github.com/BerriAI/litellm/issues/1350
|
2024-01-11 16:51:34 +05:30 |
|
Ishaan Jaff
|
75b987ffee
|
Merge pull request #1413 from BerriAI/litellm_log_cache_hits
[Feat] Proxy - Log Cache Hits on success callbacks + Testing
|
2024-01-11 16:39:22 +05:30 |
|
Krrish Dholakia
|
1472dc3f54
|
fix: n
|
2024-01-11 16:30:05 +05:30 |
|
ishaan-jaff
|
0da0909f2d
|
(feat) s3 logging - log cache hits
|
2024-01-11 15:57:54 +05:30 |
|
ishaan-jaff
|
fccd81a444
|
(fix) retry gemini-pro-vision 3 times
|
2024-01-11 14:39:08 +05:30 |
|
ishaan-jaff
|
c41b47dc8b
|
(fix) acompletion kwargs type hints
|
2024-01-11 14:22:37 +05:30 |
|
Krish Dholakia
|
300237d677
|
Merge pull request #1415 from BerriAI/litellm_bump_httpx_pool_limits
fix(router.py): bump httpx pool limits
|
2024-01-11 13:46:31 +05:30 |
|
Krrish Dholakia
|
da2af9f45e
|
fix(router.py): bump httpx pool limits
|
2024-01-11 12:51:29 +05:30 |
|
ishaan-jaff
|
29393fb512
|
(fix) acompletion typehints - pass kwargs
|
2024-01-11 11:49:55 +05:30 |
|
ishaan-jaff
|
a645e982ee
|
(test) s3 log cache hits
|
2024-01-11 11:44:48 +05:30 |
|
ishaan-jaff
|
ae8b6b31e2
|
(fix) s3 log cache hits
|
2024-01-11 11:44:20 +05:30 |
|
ishaan-jaff
|
cea0d6c8b0
|
(fix) litellm.acompletion with type hints
|
2024-01-11 10:47:12 +05:30 |
|
Ishaan Jaff
|
6e1be43595
|
Merge pull request #1200 from MateoCamara/explicit-args-acomplete
feat: added explicit args to acomplete
|
2024-01-11 10:39:05 +05:30 |
|
ishaan-jaff
|
4a47b17ba2
|
(feat) proxy - support s3_callback_params
|
2024-01-11 09:57:47 +05:30 |
|
ishaan-jaff
|
1a3a6408ac
|
(test) s3 logs for /chat/completions
|
2024-01-11 09:16:06 +05:30 |
|
ishaan-jaff
|
3f907b9835
|
(test) s3 logging
|
2024-01-11 08:58:03 +05:30 |
|
ishaan-jaff
|
426ce91481
|
(fix) working s3 logging
|
2024-01-11 08:57:32 +05:30 |
|
ishaan-jaff
|
a7fe37599e
|
v0
|
2024-01-11 08:25:40 +05:30 |
|
Krrish Dholakia
|
3ad4a44803
|
test(test_tpm_rpm_routing.py): add more logging for the test
|
2024-01-11 00:43:14 +05:30 |
|
Krrish Dholakia
|
253842bc7a
|
test(test_router.py): handle rate limiting error
|
2024-01-11 00:00:17 +05:30 |
|
Krrish Dholakia
|
e71154f286
|
fix(main.py): fix streaming completion token counting error
|
2024-01-10 23:44:35 +05:30 |
|
Krrish Dholakia
|
3ed296e2dd
|
fix(utils.py): raise correct error for azure content blocked error
|
2024-01-10 23:31:51 +05:30 |
|
Krrish Dholakia
|
568f4ae461
|
test(test_lowest_latency_routing.py): use the correct cache key
|
2024-01-10 22:15:01 +05:30 |
|
Krrish Dholakia
|
3028d0f622
|
feat(proxy_cli.py): move print statements to show actually deployed port
|
2024-01-10 22:09:58 +05:30 |
|