litellm/litellm
Ishaan Jaff 4ebe6813c0
(refactor caching) use common _retrieve_from_cache helper (#6212)
* use folder for caching

* fix importing caching

* fix clickhouse pyright

* fix linting

* fix correctly pass kwargs and args

* fix test case for embedding

* fix linting

* fix embedding caching logic

* fix refactor handle utils.py

* refactor async set stream cache

* fix linting

* refactor - use _retrieve_from_cache

* refactor use _convert_cached_result_to_model_response

* fix linting errors
2024-10-14 19:12:41 +05:30
..
adapters LiteLLM Minor Fixes & Improvements (09/27/2024) (#5938) 2024-09-27 22:52:57 -07:00
assistants Add pyright to ci/cd + Fix remaining type-checking errors (#6082) 2024-10-05 17:04:00 -04:00
batch_completion (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
batches Litellm ruff linting enforcement (#5992) 2024-10-01 19:44:20 -04:00
caching (refactor caching) use common _retrieve_from_cache helper (#6212) 2024-10-14 19:12:41 +05:30
deprecated_litellm_server (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
files Litellm ruff linting enforcement (#5992) 2024-10-01 19:44:20 -04:00
fine_tuning Litellm ruff linting enforcement (#5992) 2024-10-01 19:44:20 -04:00
integrations (feat) prometheus have well defined latency buckets (#6211) 2024-10-14 17:16:01 +05:30
litellm_core_utils (refactor caching) use LLMCachingHandler for caching streaming responses (#6210) 2024-10-14 17:46:45 +05:30
llms (refactor caching) use common _retrieve_from_cache helper (#6212) 2024-10-14 19:12:41 +05:30
proxy (feat) prometheus have well defined latency buckets (#6211) 2024-10-14 17:16:01 +05:30
realtime_api LiteLLM Minor Fixes & Improvements (10/10/2024) (#6158) 2024-10-11 23:04:36 -07:00
rerank_api Add pyright to ci/cd + Fix remaining type-checking errors (#6082) 2024-10-05 17:04:00 -04:00
router_strategy (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
router_utils (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
secret_managers (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
tests Feat: Add Langtrace integration (#5341) 2024-10-11 19:19:53 +05:30
types (refactor caching) use common _retrieve_from_cache helper (#6212) 2024-10-14 19:12:41 +05:30
__init__.py (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
_logging.py Litellm ruff linting enforcement (#5992) 2024-10-01 19:44:20 -04:00
_redis.py Add pyright to ci/cd + Fix remaining type-checking errors (#6082) 2024-10-05 17:04:00 -04:00
_service_logger.py [Fix] OTEL - Don't log messages when callback settings disable message logging (#5875) 2024-09-24 18:29:52 -07:00
_version.py Litellm ruff linting enforcement (#5992) 2024-10-01 19:44:20 -04:00
budget_manager.py Litellm ruff linting enforcement (#5992) 2024-10-01 19:44:20 -04:00
cost.json store llm costs in budget manager 2023-09-09 19:11:35 -07:00
cost_calculator.py LiteLLM Minor Fixes & Improvements (10/09/2024) (#6139) 2024-10-10 00:42:11 -07:00
exceptions.py LiteLLM Minor Fixes & Improvements (09/21/2024) (#5819) 2024-09-21 18:51:53 -07:00
main.py (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
model_prices_and_context_window_backup.json Litellm Minor Fixes & Improvements (10/12/2024) (#6179) 2024-10-12 11:48:34 -07:00
py.typed feature - Types for mypy - #360 2024-05-30 14:14:41 -04:00
requirements.txt Add symlink and only copy in source dir to stay under 50MB compressed limit for Lambdas. 2023-11-22 23:07:33 -05:00
router.py (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
scheduler.py (refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208) 2024-10-14 16:34:01 +05:30
timeout.py Litellm ruff linting enforcement (#5992) 2024-10-01 19:44:20 -04:00
utils.py (refactor caching) use LLMCachingHandler for caching streaming responses (#6210) 2024-10-14 17:46:45 +05:30