litellm/tests/load_tests
Ishaan Jaff c8d15544c8
[Fix] Router cooldown logic - use % thresholds instead of allowed fails to cooldown deployments (#5698)
* move cooldown logic to it's own helper

* add new track deployment metrics folder

* increment success, fails for deployment in current minute

* fix cooldown logic

* fix test_aaarouter_dynamic_cooldown_message_retry_time

* fix test_single_deployment_no_cooldowns_test_prod_mock_completion_calls

* clean up get from deployment test

* fix _async_get_healthy_deployments

* add mock InternalServerError

* test deployment failing 25% requests

* add test_high_traffic_cooldowns_one_bad_deployment

* fix vertex load test

* add test for rate limit error models in cool down

* change default cooldown time

* fix cooldown message time

* fix cooldown on 429 error

* fix doc string for _should_cooldown_deployment

* fix sync cooldown logic router
2024-09-14 18:01:19 -07:00
..
test_datadog_load_test.py [Feat - Perf Improvement] DataDog Logger 91% lower latency (#5687) 2024-09-13 17:39:17 -07:00
test_langsmith_load_test.py fix langsmith load tests 2024-09-11 21:19:03 -07:00
test_otel_load_test.py fix otel load test 2024-09-11 21:27:31 -07:00
test_vertex_load_tests.py [Fix] Router cooldown logic - use % thresholds instead of allowed fails to cooldown deployments (#5698) 2024-09-14 18:01:19 -07:00
vertex_key.json [Fix] Performance - use in memory cache when downloading images from a url (#5657) 2024-09-13 07:23:42 -07:00