Commit graph

139 commits

Author SHA1 Message Date
Krrish Dholakia
ea7968e22e test(test_caching.py): skip local test 2024-08-21 15:05:18 -07:00
Krrish Dholakia
33c9c16388 feat(caching.py): redis cluster support
Closes https://github.com/BerriAI/litellm/issues/4358
2024-08-21 15:01:52 -07:00
Ishaan Jaff
b196f41d64 fix qdrant litellm on proxy 2024-08-21 12:52:29 -07:00
Ishaan Jaff
0d812b0d81 fix qdrant semantic caching test 2024-08-21 12:11:49 -07:00
Ishaan Jaff
0f3274b074 fix drant url 2024-08-21 12:09:09 -07:00
Ishaan Jaff
a34aeafdb5 Merge pull request #5018 from haadirakhangi/main
Qdrant Semantic Caching
2024-08-21 08:50:43 -07:00
Haadi Rakhangi
2e84b2fca6 added testing for qdrant semantic caching 2024-08-20 00:29:50 +05:30
Krrish Dholakia
0d82089136 test(test_caching.py): re-introduce testing for s3 cache w/ streaming
Closes https://github.com/BerriAI/litellm/issues/3268
2024-08-19 10:56:48 -07:00
Krrish Dholakia
ae51c99025 fix: cleanup test 2024-08-05 11:23:49 -07:00
Krrish Dholakia
d526a12080 fix(init.py): rename feature_flag 2024-08-05 11:23:20 -07:00
Krrish Dholakia
8500f6d087 feat(caching.py): enable caching on provider-specific optional params
Closes https://github.com/BerriAI/litellm/issues/5049
2024-08-05 11:18:59 -07:00
Krrish Dholakia
e91f6153c8 fix(litellm_logging.py): fix async caching for sync streaming calls (don't do it)
Checks if call is async before running async caching for streaming call

Fixes https://github.com/BerriAI/litellm/issues/4511#issuecomment-2233211808
2024-07-17 11:15:30 -07:00
Krrish Dholakia
a79cb33960 fix(caching.py): fix async redis health check 2024-07-06 09:14:29 -07:00
Krrish Dholakia
131fcac7d8 fix(_service_logging.py): only trigger otel if in service_callback
Fixes https://github.com/BerriAI/litellm/issues/4511
2024-07-03 09:48:38 -07:00
Antonio Loison
208d921988 test(test_caching.py): use mock_response in disk cache test 2024-05-10 11:00:18 +02:00
Antonio Loison
bca84d46b1 test(test_caching.py): add disk cache test when using completion 2024-05-10 10:03:38 +02:00
Krrish Dholakia
fea0e6bb19 fix(test_caching.py): add longer delay for async test 2024-04-23 16:13:03 -07:00
Krrish Dholakia
04014c752b fix(utils.py): fix 'no-cache': true when caching is turned on 2024-04-23 12:58:30 -07:00
Krish Dholakia
a9dc93e860 Merge branch 'main' into litellm_ssl_caching_fix 2024-04-19 17:20:27 -07:00
Krrish Dholakia
be0a30e196 test(test_caching.py): add sleep 2024-04-19 17:02:15 -07:00
Krrish Dholakia
5da934099f fix(caching.py): dual cache async_batch_get_cache fix + testing
this fixes a bug in usage-based-routing-v2 which was caused b/c of how the result was being returned from dual cache async_batch_get_cache. it also adds unit testing for that function (and it's sync equivalent)
2024-04-19 15:03:25 -07:00
Ishaan Jaff
966356721b fix - test caching atext_completion 2024-04-12 20:37:56 -07:00
Ishaan Jaff
204573f3f2 test - atext_completion + caching 2024-04-12 12:32:21 -07:00
Ishaan Jaff
4058327b43 test -base64 cache hits 2024-04-10 16:46:56 -07:00
Krrish Dholakia
075c96a408 fix(utils.py): fix reordering of items for cached embeddings
ensures cached embedding item is returned in correct order
2024-04-08 12:18:24 -07:00
Krrish Dholakia
9b8e1fe052 test(test_caching.py): skip test - aws suspended account
will need to recreate these objects on a new aws account
2024-04-04 15:07:19 -07:00
Krish Dholakia
d620d94134 Merge pull request #2692 from BerriAI/litellm_streaming_fixes
fix(utils.py): ensure last chunk is always empty delta w/ finish reason
2024-03-25 21:57:04 -07:00
Krrish Dholakia
eda65a25e3 test(test_caching.py): fix test_redis_cache_acompletion_stream 2024-03-25 21:36:47 -07:00
Ishaan Jaff
dd6dedd0aa (test) no cache hit 2024-03-25 18:56:36 -07:00
Ishaan Jaff
a0601723e5 (test) batch writing to cache 2024-03-25 18:04:04 -07:00
Krrish Dholakia
fec92767bb fix(caching.py): support default ttl for caching 2024-03-25 13:40:17 -07:00
Krrish Dholakia
dfcf16eb4d fix(caching.py): pass redis kwargs to connection pool init 2024-03-18 08:21:36 -07:00
Krrish Dholakia
45582d2fa5 test(test_caching.py): fix async tests 2024-03-15 18:09:25 -07:00
Krrish Dholakia
807bf854c3 test: reintegrate s3 testing 2024-03-07 08:56:59 -08:00
Krish Dholakia
d0dec7fc71 Merge pull request #2379 from BerriAI/litellm_s3_bucket_folder_path
fix(caching.py): add s3 path as a top-level param
2024-03-06 19:35:46 -08:00
Krrish Dholakia
12d663d693 fix(caching.py): add s3 path as a top-level param 2024-03-06 18:07:28 -08:00
Krrish Dholakia
ee2fc1e5e8 test(test_caching.py): fix test to check on id 2024-03-05 21:12:50 -08:00
Krrish Dholakia
dad65ca602 fix(bedrock.py): support anthropic messages api on bedrock (claude-3) 2024-03-04 17:15:47 -08:00
ishaan-jaff
f9bc8e2a76 (test) our AWS account is Suspended 2024-02-28 18:32:27 -08:00
Krrish Dholakia
129e84f0c5 test: removing aws tests - account suspended - pending their approval 2024-02-28 13:46:20 -08:00
Krrish Dholakia
0166c2f54b test: removing bedrock claude-v1 testing - bedrock removed this 2024-02-28 11:08:17 -08:00
ishaan-jaff
d211693ae9 (test) async s3 cache 2024-02-08 10:04:10 -08:00
ishaan-jaff
be53cbc45f (ci/cd) run again 2024-02-06 13:26:48 -08:00
ishaan-jaff
01f2ae4e54 (fix) mark semantic caching as beta test 2024-02-06 11:04:19 -08:00
ishaan-jaff
2df1872e5d (fix) semantic caching 2024-02-06 10:55:15 -08:00
ishaan-jaff
167b60ace7 (fix) test-semantic caching 2024-02-06 10:39:44 -08:00
ishaan-jaff
33f5ab8ba5 (test) async semantic cache 2024-02-06 08:14:54 -08:00
ishaan-jaff
6c6db6f1e6 (test) semantic caching 2024-02-05 18:22:50 -08:00
ishaan-jaff
3ee691b9d8 (test) semantic cache 2024-02-05 17:58:32 -08:00
ishaan-jaff
1689d5790f (feat )add semantic cache 2024-02-05 12:28:21 -08:00