litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-25 18:54:30 +00:00

History

Krish Dholakia e712a2090b redis otel tracing + async support for latency routing (#6452 ) * docs(exception_mapping.md): add missing exception types Fixes https://github.com/Aider-AI/aider/issues/2120#issuecomment-2438971183 * fix(main.py): register custom model pricing with specific key Ensure custom model pricing is registered to the specific model+provider key combination * test: make testing more robust for custom pricing * fix(redis_cache.py): instrument otel logging for sync redis calls ensures complete coverage for all redis cache calls * refactor: pass parent_otel_span for redis caching calls in router allows for more observability into what calls are causing latency issues * test: update tests with new params * refactor: ensure e2e otel tracing for router * refactor(router.py): add more otel tracing acrosss router catch all latency issues for router requests * fix: fix linting error * fix(router.py): fix linting error * fix: fix test * test: fix tests * fix(dual_cache.py): pass ttl to redis cache * fix: fix param		2024-10-28 21:52:12 -07:00
..
__init__.py	fix(proxy_server.py): enable pre+post-call hooks and max parallel request limits	2023-12-08 17:11:30 -08:00
azure_content_safety.py	(refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208 )	2024-10-14 16:34:01 +05:30
batch_redis_get.py	(refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208 )	2024-10-14 16:34:01 +05:30
cache_control_check.py	(refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208 )	2024-10-14 16:34:01 +05:30
dynamic_rate_limiter.py	(refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208 )	2024-10-14 16:34:01 +05:30
example_presidio_ad_hoc_recognizer.json	fix(presidio_pii_masking.py): enable user to pass ad hoc recognizer for pii masking	2024-02-20 16:01:15 -08:00
max_budget_limiter.py	redis otel tracing + async support for latency routing (#6452 )	2024-10-28 21:52:12 -07:00
parallel_request_limiter.py	(code quality) add ruff check PLR0915 for `too-many-statements` (#6309 )	2024-10-18 15:36:49 +05:30
presidio_pii_masking.py	(refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208 )	2024-10-14 16:34:01 +05:30
prompt_injection_detection.py	(refactor) caching use LLMCachingHandler for async_get_cache and set_cache (#6208 )	2024-10-14 16:34:01 +05:30