litellm-mirror/litellm/router_strategy
Krrish Dholakia 90cdfef1c1 fix(lowest_latency.py): allow setting a buffer for getting values within a certain latency threshold
if an endpoint is slow - it's completion time might not be updated till the call is completed. This prevents us from overloading those endpoints, in a simple way.
2024-04-30 12:00:26 -07:00
..
least_busy.py fix(router.py): handle id being passed in as int 2024-04-04 14:23:10 -07:00
lowest_latency.py fix(lowest_latency.py): allow setting a buffer for getting values within a certain latency threshold 2024-04-30 12:00:26 -07:00
lowest_tpm_rpm.py usage based routing RPM count fix 2024-04-30 00:29:38 +05:30
lowest_tpm_rpm_v2.py fix(lowest_tpm_rpm_v2.py): skip if item_tpm is None 2024-04-29 21:34:25 -07:00