Krrish Dholakia
|
6cca5612d2
|
refactor: replace 'traceback.print_exc()' with logging library
allows error logs to be in json format for otel logging
|
2024-06-06 13:47:43 -07:00 |
|
sumanth
|
71e0294485
|
addressed comments
|
2024-05-14 10:05:19 +05:30 |
|
SUMANTH
|
978672a56d
|
Merge branch 'BerriAI:main' into usage-based-routing-ttl-on-cache
|
2024-05-14 09:08:01 +05:30 |
|
Krrish Dholakia
|
4a3b084961
|
feat(bedrock_httpx.py): moves to using httpx client for bedrock cohere calls
|
2024-05-11 13:43:08 -07:00 |
|
sumanth
|
3bc6b5d119
|
usage-based-routing-ttl-on-cache
|
2024-05-03 10:50:45 +05:30 |
|
sumanth
|
89e655c79e
|
usage based routing RPM count fix
|
2024-04-30 00:29:38 +05:30 |
|
Krrish Dholakia
|
2236f283fe
|
fix(router.py): handle id being passed in as int
|
2024-04-04 14:23:10 -07:00 |
|
Krrish Dholakia
|
47ca223d0b
|
fix(lowest_tpm_rpm_routing.py): fix base case where max tpm/rpm is 0
|
2024-03-28 14:51:31 -07:00 |
|
Ishaan Jaff
|
5d121a9f3c
|
(fix) stop using f strings with logger
|
2024-03-25 10:47:18 -07:00 |
|
Krrish Dholakia
|
2f1899284c
|
fix(router.py): add more debug logs
|
2024-03-11 12:34:35 -07:00 |
|
ishaan-jaff
|
e23c68b15a
|
(fix) failing usage based routing test
|
2024-03-11 12:14:13 -07:00 |
|
Krrish Dholakia
|
0273410310
|
fix(lowest_tpm_rpm.py): handle async scenarios
|
2024-03-06 21:38:30 -08:00 |
|
ishaan-jaff
|
d0442ae0f2
|
(feat) router - usage based routing - consider input_tokens
|
2024-01-19 13:59:49 -08:00 |
|
Krrish Dholakia
|
2ab31bcaf8
|
fix(lowest_tpm_rpm.py): handle null case for text/message input
|
2024-01-02 12:24:29 +05:30 |
|
Krrish Dholakia
|
a37a18ca80
|
feat(router.py): add support for retry/fallbacks for async embedding calls
|
2024-01-02 11:54:28 +05:30 |
|
Krrish Dholakia
|
dff4c172d0
|
refactor(test_router_caching.py): move tpm/rpm routing tests to separate file
|
2024-01-02 11:10:11 +05:30 |
|
Krrish Dholakia
|
a83e2e07cf
|
fix(router.py): correctly raise no model available error
https://github.com/BerriAI/litellm/issues/1289
|
2024-01-01 21:22:42 +05:30 |
|
Krrish Dholakia
|
b66cf0aa43
|
fix(lowest_tpm_rpm_routing.py): broaden scope of get deployment logic
|
2023-12-30 13:27:50 +05:30 |
|
Krrish Dholakia
|
a34de56289
|
fix(router.py): handle initial scenario for tpm/rpm routing
|
2023-12-30 07:28:45 +05:30 |
|
Krrish Dholakia
|
2fc264ca04
|
fix(router.py): fix int logic
|
2023-12-29 20:41:56 +05:30 |
|
Krrish Dholakia
|
cf91e49c87
|
refactor(lowest_tpm_rpm.py): move tpm/rpm based routing to a separate file for better testing
|
2023-12-29 18:33:43 +05:30 |
|