litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 11:43:54 +00:00

History

Krrish Dholakia 90cdfef1c1 fix(lowest_latency.py): allow setting a buffer for getting values within a certain latency threshold if an endpoint is slow - it's completion time might not be updated till the call is completed. This prevents us from overloading those endpoints, in a simple way.		2024-04-30 12:00:26 -07:00
..
least_busy.py	fix(router.py): handle id being passed in as int	2024-04-04 14:23:10 -07:00
lowest_latency.py	fix(lowest_latency.py): allow setting a buffer for getting values within a certain latency threshold	2024-04-30 12:00:26 -07:00
lowest_tpm_rpm.py	usage based routing RPM count fix	2024-04-30 00:29:38 +05:30
lowest_tpm_rpm_v2.py	fix(lowest_tpm_rpm_v2.py): skip if item_tpm is None	2024-04-29 21:34:25 -07:00