Commit graph

8 commits

Author SHA1 Message Date
Krrish Dholakia
6a8d518e44 test(test_lowest_latency_routing.py): use the correct cache key 2024-01-10 22:15:01 +05:30
Krrish Dholakia
9a829ff956 refactor: cleanup duplicates 2024-01-10 21:42:20 +05:30
Krish Dholakia
298e937586
Merge branch 'main' into litellm_latency_routing_updates 2024-01-10 21:33:54 +05:30
Krrish Dholakia
fe632c08a4 fix(router.py): allow user to control the latency routing time window 2024-01-10 20:56:52 +05:30
Krrish Dholakia
bb04a340a5 fix(lowest_latency.py): add back tpm/rpm checks, configurable time window 2024-01-10 20:52:01 +05:30
Krrish Dholakia
a5147f9e06 feat(lowest_latency.py): support expanded time window for latency based routing
uses a 1hr avg. of latency for deployments, to determine which to route to

https://github.com/BerriAI/litellm/issues/1361
2024-01-09 09:38:04 +05:30
Krrish Dholakia
027218c3f0 test(test_lowest_latency_routing.py): add more tests 2023-12-30 17:41:42 +05:30
Krrish Dholakia
f2d0d5584a fix(router.py): fix latency based routing 2023-12-30 17:25:40 +05:30