Ishaan Jaff
|
b196f41d64
|
fix qdrant litellm on proxy
|
2024-08-21 12:52:29 -07:00 |
|
Ishaan Jaff
|
8c83fb3f34
|
fixes for using qdrant with litellm proxy
|
2024-08-21 12:36:41 -07:00 |
|
Ishaan Jaff
|
0f3274b074
|
fix drant url
|
2024-08-21 12:09:09 -07:00 |
|
Ishaan Jaff
|
a34aeafdb5
|
Merge pull request #5018 from haadirakhangi/main
Qdrant Semantic Caching
|
2024-08-21 08:50:43 -07:00 |
|
Haadi Rakhangi
|
9df92923d8
|
implemented RestAPI and added support for cloud and local Qdrant clusters
|
2024-08-19 20:46:30 +05:30 |
|
Krrish Dholakia
|
2874b94fb1
|
refactor: replace .error() with .exception() logging for better debugging on sentry
|
2024-08-16 09:22:47 -07:00 |
|
prd-tuong-nguyen
|
70f2e84bc4
|
feat: hash prompt when caching
|
2024-08-08 16:19:14 +07:00 |
|
Ishaan Jaff
|
5b7d1b0ae4
|
caching use file_checksum
|
2024-08-06 13:03:14 -07:00 |
|
Krrish Dholakia
|
d526a12080
|
fix(init.py): rename feature_flag
|
2024-08-05 11:23:20 -07:00 |
|
Krrish Dholakia
|
8500f6d087
|
feat(caching.py): enable caching on provider-specific optional params
Closes https://github.com/BerriAI/litellm/issues/5049
|
2024-08-05 11:18:59 -07:00 |
|
Ishaan Jaff
|
d122508385
|
use file name when getting cache key
|
2024-08-02 14:52:08 -07:00 |
|
Haadi Rakhangi
|
a047df3825
|
qdrant semantic caching added
|
2024-08-02 21:07:19 +05:30 |
|
Krrish Dholakia
|
a75b70fbd6
|
fix(caching.py): support /completion caching by default
updates supported call types in redis cache to cover text_completion caching
|
2024-07-29 08:19:30 -07:00 |
|
Ishaan Jaff
|
aade38760d
|
use common helpers for writing to otel
|
2024-07-27 11:40:39 -07:00 |
|
Ishaan Jaff
|
40f9e67be4
|
move _get_parent_otel_span_from_kwargs to otel.py
|
2024-07-27 11:12:13 -07:00 |
|
Ishaan Jaff
|
7c489856e3
|
add doc string to explain what delete cache does
|
2024-07-13 12:25:31 -07:00 |
|
Ishaan Jaff
|
9d657c42d8
|
de-ref unused cache items
|
2024-07-12 16:38:36 -07:00 |
|
Krrish Dholakia
|
a79cb33960
|
fix(caching.py): fix async redis health check
|
2024-07-06 09:14:29 -07:00 |
|
Ishaan Jaff
|
a1968eaf3f
|
remove debug print statement
|
2024-06-27 20:58:29 -07:00 |
|
Ishaan Jaff
|
5977b5be20
|
ci/cd add debugging for cache eviction
|
2024-06-25 08:14:09 -07:00 |
|
Ishaan Jaff
|
f800425744
|
fix default ttl for InMemoryCache
|
2024-06-24 21:21:38 -07:00 |
|
Ishaan Jaff
|
3ebf1ec7eb
|
feat use custom eviction policy
|
2024-06-24 20:28:03 -07:00 |
|
Ishaan Jaff
|
4e8f2a57e0
|
fix install on python 3.8
|
2024-06-24 17:27:14 -07:00 |
|
Ishaan Jaff
|
5b19aac705
|
cleanup InMemoryCache
|
2024-06-24 17:24:59 -07:00 |
|
Ishaan Jaff
|
4f03556af6
|
use lru cache
|
2024-06-24 17:15:53 -07:00 |
|
Ishaan Jaff
|
81ef2c38dc
|
fix InMemoryCache
|
2024-06-24 17:08:30 -07:00 |
|
Ishaan Jaff
|
21fd91fe94
|
fix use caching lib
|
2024-06-24 17:03:23 -07:00 |
|
Ishaan Jaff
|
0c4c6bfa5e
|
fix in mem cache tests
|
2024-06-22 19:52:18 -07:00 |
|
Ishaan Jaff
|
8e3a073323
|
Merge branch 'main' into litellm_fix_in_mem_usage
|
2024-06-22 19:23:37 -07:00 |
|
Ishaan Jaff
|
5b2d4da43f
|
fix caching clear in memory cache mem util
|
2024-06-22 19:21:37 -07:00 |
|
Ishaan Jaff
|
c4ae06576b
|
fix - clean up in memory cache
|
2024-06-22 18:46:30 -07:00 |
|
Krrish Dholakia
|
0430807178
|
feat(dynamic_rate_limiter.py): update cache with active project
|
2024-06-21 20:25:40 -07:00 |
|
David Manouchehri
|
47e3880638
|
fix(caching.py): Stop throwing constant spam errors on every single S3 cache miss. Fixes #4146.
|
2024-06-13 20:58:18 +00:00 |
|
Ishaan Jaff
|
786e6b4ae3
|
feat - final working redis cache otel
|
2024-06-07 16:36:04 -07:00 |
|
Ishaan Jaff
|
72a6d49b21
|
feat - working exception logs for Redis errors
|
2024-06-07 16:30:29 -07:00 |
|
Ishaan Jaff
|
ef99e3e1d0
|
fix - basic success logging for redis cache
|
2024-06-07 16:20:23 -07:00 |
|
Krrish Dholakia
|
e391e30285
|
refactor: replace 'traceback.print_exc()' with logging library
allows error logs to be in json format for otel logging
|
2024-06-06 13:47:43 -07:00 |
|
Krrish Dholakia
|
2db3819142
|
fix(proxy_server.py): allow user_api_key_cache_ttl to be a controllable param
|
2024-05-25 12:07:28 -07:00 |
|
Ishaan Jaff
|
a3fb6e8c34
|
Merge pull request #3266 from antonioloison/litellm_add_disk_cache
[Feature] Add cache to disk
|
2024-05-14 09:24:01 -07:00 |
|
Rahul Kataria
|
927b6fa967
|
[Optimize] Optimize code in caching file
|
2024-05-12 17:04:18 +05:30 |
|
Rahul Kataria
|
6c16f2924b
|
[Reset Code]
|
2024-05-12 16:37:50 +05:30 |
|
Rahul Kataria
|
9b77b8c90b
|
[Optimize] Optimize the code in caching file
|
2024-05-12 15:04:45 +05:30 |
|
Antonio Loison
|
dabf4cf8a8
|
build(caching.py): move diskcache import inside class and add cache_dir argument to Cache
|
2024-05-10 12:04:54 +02:00 |
|
Antonio Loison
|
79d774e330
|
chore: add diskcache as extra dependency
|
2024-05-10 11:19:14 +02:00 |
|
Antonio Loison
|
04b5dbef1a
|
build(caching.py): add disk option for cache
|
2024-05-10 10:03:38 +02:00 |
|
Antonio Loison
|
34f4f719aa
|
build(caching.py) - add disk cache object
|
2024-05-10 10:03:35 +02:00 |
|
Krrish Dholakia
|
0c1fc34e9f
|
feat(slack_alerting.py): support sending daily reports on deployments
allow admin to easily know slow + failing deployments
Closes https://github.com/BerriAI/litellm/issues/3483
|
2024-05-06 17:18:42 -07:00 |
|
Krrish Dholakia
|
da4991de05
|
fix(caching.py): fix redis caching ping check
don't fail to startup. Log an error message.
|
2024-05-04 08:48:53 -07:00 |
|
Krish Dholakia
|
a9dc93e860
|
Merge branch 'main' into litellm_ssl_caching_fix
|
2024-04-19 17:20:27 -07:00 |
|
Krrish Dholakia
|
b3a8c2885b
|
test(test_prometheus_services.py): fix testing to handle caching ping in init
|
2024-04-19 16:15:29 -07:00 |
|