Krrish Dholakia
|
48a5948081
|
fix(router.py): handle id being passed in as int
|
2024-04-04 14:23:10 -07:00 |
|
Krrish Dholakia
|
568f4ae461
|
test(test_lowest_latency_routing.py): use the correct cache key
|
2024-01-10 22:15:01 +05:30 |
|
Krrish Dholakia
|
744e8bccc7
|
refactor: cleanup duplicates
|
2024-01-10 21:42:20 +05:30 |
|
Krish Dholakia
|
e635ca2151
|
Merge branch 'main' into litellm_latency_routing_updates
|
2024-01-10 21:33:54 +05:30 |
|
Krrish Dholakia
|
7df19b2f7c
|
fix(router.py): allow user to control the latency routing time window
|
2024-01-10 20:56:52 +05:30 |
|
Krrish Dholakia
|
f288b12411
|
fix(lowest_latency.py): add back tpm/rpm checks, configurable time window
|
2024-01-10 20:52:01 +05:30 |
|
Krrish Dholakia
|
fb9ebfbedd
|
feat(lowest_latency.py): support expanded time window for latency based routing
uses a 1hr avg. of latency for deployments, to determine which to route to
https://github.com/BerriAI/litellm/issues/1361
|
2024-01-09 09:38:04 +05:30 |
|
Krrish Dholakia
|
d3dee9b20c
|
test(test_lowest_latency_routing.py): add more tests
|
2023-12-30 17:41:42 +05:30 |
|
Krrish Dholakia
|
25ee96271e
|
fix(router.py): fix latency based routing
|
2023-12-30 17:25:40 +05:30 |
|