Commit graph

21 commits

Author SHA1 Message Date
Krrish Dholakia
6cca5612d2 refactor: replace 'traceback.print_exc()' with logging library
allows error logs to be in json format for otel logging
2024-06-06 13:47:43 -07:00
sumanth
71e0294485 addressed comments 2024-05-14 10:05:19 +05:30
SUMANTH
978672a56d
Merge branch 'BerriAI:main' into usage-based-routing-ttl-on-cache 2024-05-14 09:08:01 +05:30
Krrish Dholakia
4a3b084961 feat(bedrock_httpx.py): moves to using httpx client for bedrock cohere calls 2024-05-11 13:43:08 -07:00
sumanth
3bc6b5d119 usage-based-routing-ttl-on-cache 2024-05-03 10:50:45 +05:30
sumanth
89e655c79e usage based routing RPM count fix 2024-04-30 00:29:38 +05:30
Krrish Dholakia
2236f283fe fix(router.py): handle id being passed in as int 2024-04-04 14:23:10 -07:00
Krrish Dholakia
47ca223d0b fix(lowest_tpm_rpm_routing.py): fix base case where max tpm/rpm is 0 2024-03-28 14:51:31 -07:00
Ishaan Jaff
5d121a9f3c (fix) stop using f strings with logger 2024-03-25 10:47:18 -07:00
Krrish Dholakia
2f1899284c fix(router.py): add more debug logs 2024-03-11 12:34:35 -07:00
ishaan-jaff
e23c68b15a (fix) failing usage based routing test 2024-03-11 12:14:13 -07:00
Krrish Dholakia
0273410310 fix(lowest_tpm_rpm.py): handle async scenarios 2024-03-06 21:38:30 -08:00
ishaan-jaff
d0442ae0f2 (feat) router - usage based routing - consider input_tokens 2024-01-19 13:59:49 -08:00
Krrish Dholakia
2ab31bcaf8 fix(lowest_tpm_rpm.py): handle null case for text/message input 2024-01-02 12:24:29 +05:30
Krrish Dholakia
a37a18ca80 feat(router.py): add support for retry/fallbacks for async embedding calls 2024-01-02 11:54:28 +05:30
Krrish Dholakia
dff4c172d0 refactor(test_router_caching.py): move tpm/rpm routing tests to separate file 2024-01-02 11:10:11 +05:30
Krrish Dholakia
a83e2e07cf fix(router.py): correctly raise no model available error
https://github.com/BerriAI/litellm/issues/1289
2024-01-01 21:22:42 +05:30
Krrish Dholakia
b66cf0aa43 fix(lowest_tpm_rpm_routing.py): broaden scope of get deployment logic 2023-12-30 13:27:50 +05:30
Krrish Dholakia
a34de56289 fix(router.py): handle initial scenario for tpm/rpm routing 2023-12-30 07:28:45 +05:30
Krrish Dholakia
2fc264ca04 fix(router.py): fix int logic 2023-12-29 20:41:56 +05:30
Krrish Dholakia
cf91e49c87 refactor(lowest_tpm_rpm.py): move tpm/rpm based routing to a separate file for better testing 2023-12-29 18:33:43 +05:30