Ishaan Jaff
01adc45f0f
test caching default on /off
2024-08-24 09:06:59 -07:00
Krrish Dholakia
f24075bcaf
test(test_caching.py): skip local test
2024-08-21 15:05:18 -07:00
Krrish Dholakia
e2d7539690
feat(caching.py): redis cluster support
...
Closes https://github.com/BerriAI/litellm/issues/4358
2024-08-21 15:01:52 -07:00
Ishaan Jaff
e7ecb2fe3a
fix qdrant litellm on proxy
2024-08-21 12:52:29 -07:00
Ishaan Jaff
228a0bd6f7
fix qdrant semantic caching test
2024-08-21 12:11:49 -07:00
Ishaan Jaff
428a74be07
fix drant url
2024-08-21 12:09:09 -07:00
Ishaan Jaff
7d0196191f
Merge pull request #5018 from haadirakhangi/main
...
Qdrant Semantic Caching
2024-08-21 08:50:43 -07:00
Haadi Rakhangi
df5074da56
added testing for qdrant semantic caching
2024-08-20 00:29:50 +05:30
Krrish Dholakia
3cafebbc65
test(test_caching.py): re-introduce testing for s3 cache w/ streaming
...
Closes https://github.com/BerriAI/litellm/issues/3268
2024-08-19 10:56:48 -07:00
Krrish Dholakia
aef25d5d00
fix: cleanup test
2024-08-05 11:23:49 -07:00
Krrish Dholakia
a9fdfb5a99
fix(init.py): rename feature_flag
2024-08-05 11:23:20 -07:00
Krrish Dholakia
3c4c78a71f
feat(caching.py): enable caching on provider-specific optional params
...
Closes https://github.com/BerriAI/litellm/issues/5049
2024-08-05 11:18:59 -07:00
Krrish Dholakia
d2e64f21f3
fix(litellm_logging.py): fix async caching for sync streaming calls (don't do it)
...
Checks if call is async before running async caching for streaming call
Fixes https://github.com/BerriAI/litellm/issues/4511#issuecomment-2233211808
2024-07-17 11:15:30 -07:00
Krrish Dholakia
3f83e8a8d4
fix(caching.py): fix async redis health check
2024-07-06 09:14:29 -07:00
Krrish Dholakia
606d04b05b
fix(_service_logging.py): only trigger otel if in service_callback
...
Fixes https://github.com/BerriAI/litellm/issues/4511
2024-07-03 09:48:38 -07:00
Antonio Loison
7ee07cd961
test(test_caching.py): use mock_response in disk cache test
2024-05-10 11:00:18 +02:00
Antonio Loison
ac27f431a4
test(test_caching.py): add disk cache test when using completion
2024-05-10 10:03:38 +02:00
Krrish Dholakia
d67e47d7fd
fix(test_caching.py): add longer delay for async test
2024-04-23 16:13:03 -07:00
Krrish Dholakia
161e836427
fix(utils.py): fix 'no-cache': true when caching is turned on
2024-04-23 12:58:30 -07:00
Krish Dholakia
6d9f0f1839
Merge branch 'main' into litellm_ssl_caching_fix
2024-04-19 17:20:27 -07:00
Krrish Dholakia
978c1a1976
test(test_caching.py): add sleep
2024-04-19 17:02:15 -07:00
Krrish Dholakia
01a1a8f731
fix(caching.py): dual cache async_batch_get_cache fix + testing
...
this fixes a bug in usage-based-routing-v2 which was caused b/c of how the result was being returned from dual cache async_batch_get_cache. it also adds unit testing for that function (and it's sync equivalent)
2024-04-19 15:03:25 -07:00
Ishaan Jaff
f617f5ebb5
fix - test caching atext_completion
2024-04-12 20:37:56 -07:00
Ishaan Jaff
11cd1ec6cf
test - atext_completion + caching
2024-04-12 12:32:21 -07:00
Ishaan Jaff
8bc02b34c2
test -base64 cache hits
2024-04-10 16:46:56 -07:00
Krrish Dholakia
48bfc45cb0
fix(utils.py): fix reordering of items for cached embeddings
...
ensures cached embedding item is returned in correct order
2024-04-08 12:18:24 -07:00
Krrish Dholakia
2472311a3f
test(test_caching.py): skip test - aws suspended account
...
will need to recreate these objects on a new aws account
2024-04-04 15:07:19 -07:00
Krish Dholakia
eb859f5dba
Merge pull request #2692 from BerriAI/litellm_streaming_fixes
...
fix(utils.py): ensure last chunk is always empty delta w/ finish reason
2024-03-25 21:57:04 -07:00
Krrish Dholakia
643fd6ac96
test(test_caching.py): fix test_redis_cache_acompletion_stream
2024-03-25 21:36:47 -07:00
Ishaan Jaff
07fe08d8b5
(test) no cache hit
2024-03-25 18:56:36 -07:00
Ishaan Jaff
3fcab0137a
(test) batch writing to cache
2024-03-25 18:04:04 -07:00
Krrish Dholakia
591a0a376e
fix(caching.py): support default ttl for caching
2024-03-25 13:40:17 -07:00
Krrish Dholakia
03acc07380
fix(caching.py): pass redis kwargs to connection pool init
2024-03-18 08:21:36 -07:00
Krrish Dholakia
3072137739
test(test_caching.py): fix async tests
2024-03-15 18:09:25 -07:00
Krrish Dholakia
4885bff9e3
test: reintegrate s3 testing
2024-03-07 08:56:59 -08:00
Krish Dholakia
06bde2b8c0
Merge pull request #2379 from BerriAI/litellm_s3_bucket_folder_path
...
fix(caching.py): add s3 path as a top-level param
2024-03-06 19:35:46 -08:00
Krrish Dholakia
726dad5756
fix(caching.py): add s3 path as a top-level param
2024-03-06 18:07:28 -08:00
Krrish Dholakia
8a4a14cc95
test(test_caching.py): fix test to check on id
2024-03-05 21:12:50 -08:00
Krrish Dholakia
478307d4cf
fix(bedrock.py): support anthropic messages api on bedrock (claude-3)
2024-03-04 17:15:47 -08:00
ishaan-jaff
957da9fbeb
(test) our AWS account is Suspended
2024-02-28 18:32:27 -08:00
Krrish Dholakia
4c951d20bc
test: removing aws tests - account suspended - pending their approval
2024-02-28 13:46:20 -08:00
Krrish Dholakia
a042092faa
test: removing bedrock claude-v1 testing - bedrock removed this
2024-02-28 11:08:17 -08:00
ishaan-jaff
8a615cd125
(test) async s3 cache
2024-02-08 10:04:10 -08:00
ishaan-jaff
79c225a60f
(ci/cd) run again
2024-02-06 13:26:48 -08:00
ishaan-jaff
8175fb4deb
(fix) mark semantic caching as beta test
2024-02-06 11:04:19 -08:00
ishaan-jaff
1afdf5cf36
(fix) semantic caching
2024-02-06 10:55:15 -08:00
ishaan-jaff
c8a83bb745
(fix) test-semantic caching
2024-02-06 10:39:44 -08:00
ishaan-jaff
a125ffe190
(test) async semantic cache
2024-02-06 08:14:54 -08:00
ishaan-jaff
81f8ac00b2
(test) semantic caching
2024-02-05 18:22:50 -08:00
ishaan-jaff
cf4bd1cf4e
(test) semantic cache
2024-02-05 17:58:32 -08:00