litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 19:24:27 +00:00

History

Krish Dholakia 80f7af510b Improve Proxy Resiliency: Cooldown single-deployment model groups if 100% calls failed in high traffic (#7823 ) * refactor(_is_cooldown_required): move '_is_cooldown_required' into cooldown_handlers.py * refactor(cooldown_handlers.py): move cooldown constants into `.constants.py` * fix(cooldown_handlers.py): remove if single deployment don't cooldown logic move to traffic based cooldown logic Addresses https://github.com/BerriAI/litellm/issues/7822 * fix: add unit tests for '_should_cooldown_deployment' * test: ensure all tests pass * test: update test * fix(cooldown_handlers.py): don't cooldown single deployment models for anything besides traffic related errors * fix(cooldown_handlers.py): fix cooldown handler logic * fix(cooldown_handlers.py): fix check		2025-01-17 20:17:02 -08:00
..
pre_call_checks	Litellm dev readd prompt caching (#7299 )	2024-12-18 15:13:49 -08:00
router_callbacks	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
batch_utils.py	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
client_initalization_utils.py	(Feat) - LiteLLM Use `UsernamePasswordCredential` for Azure OpenAI (#7496 )	2025-01-01 14:11:27 -08:00
cooldown_cache.py	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
cooldown_callbacks.py	Litellm dev readd prompt caching (#7299 )	2024-12-18 15:13:49 -08:00
cooldown_handlers.py	Improve Proxy Resiliency: Cooldown single-deployment model groups if 100% calls failed in high traffic (#7823 )	2025-01-17 20:17:02 -08:00
fallback_event_handlers.py	Controll fallback prompts client-side (#7334 )	2024-12-20 19:09:53 -08:00
get_retry_from_policy.py	Litellm dev 12 06 2024 (#7067 )	2024-12-06 22:44:18 -08:00
handle_error.py	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
pattern_match_deployments.py	Litellm dev 12 28 2024 p2 (#7458 )	2024-12-28 19:38:06 -08:00
prompt_caching_cache.py	(code quality) run ruff rule to ban unused imports (#7313 )	2024-12-19 12:33:42 -08:00
response_headers.py	LiteLLM Minor Fixes & Improvements (11/26/2024) (#6913 )	2024-11-28 00:01:38 +05:30