litellm

History

Ishaan Jaff c8d15544c8 [Fix] Router cooldown logic - use % thresholds instead of allowed fails to cooldown deployments (#5698 ) * move cooldown logic to it's own helper * add new track deployment metrics folder * increment success, fails for deployment in current minute * fix cooldown logic * fix test_aaarouter_dynamic_cooldown_message_retry_time * fix test_single_deployment_no_cooldowns_test_prod_mock_completion_calls * clean up get from deployment test * fix _async_get_healthy_deployments * add mock InternalServerError * test deployment failing 25% requests * add test_high_traffic_cooldowns_one_bad_deployment * fix vertex load test * add test for rate limit error models in cool down * change default cooldown time * fix cooldown message time * fix cooldown on 429 error * fix doc string for _should_cooldown_deployment * fix sync cooldown logic router		2024-09-14 18:01:19 -07:00
..
test_datadog_load_test.py	[Feat - Perf Improvement] DataDog Logger 91% lower latency (#5687 )	2024-09-13 17:39:17 -07:00
test_langsmith_load_test.py	fix langsmith load tests	2024-09-11 21:19:03 -07:00
test_otel_load_test.py	fix otel load test	2024-09-11 21:27:31 -07:00
test_vertex_load_tests.py	[Fix] Router cooldown logic - use % thresholds instead of allowed fails to cooldown deployments (#5698 )	2024-09-14 18:01:19 -07:00
vertex_key.json	[Fix] Performance - use in memory cache when downloading images from a url (#5657 )	2024-09-13 07:23:42 -07:00