Commit graph

32 commits

Author SHA1 Message Date
Krrish Dholakia
cdec7a414f test(test_router_fallbacks.py): fix test 2024-05-10 09:58:40 -07:00
Krrish Dholakia
69bcde6f6b test(test_router_fallbacks.py): fix test - making it easier to debug 2024-05-06 18:49:36 -07:00
Krrish Dholakia
a35ac050d2 test(test_router_fallbacks.py): bump test limits 2024-05-03 20:42:29 -07:00
Krrish Dholakia
cfc1eeb3c3 test(test_router_fallbacks.py): rename test to run earlier 2024-04-30 22:04:20 -07:00
Krrish Dholakia
e506e71cb9 fix(test_router_fallbacks.py): reduce test rpm 2024-04-30 22:00:48 -07:00
Krrish Dholakia
bc5c9d7da9 fix(test_router_fallbacks.py): fix tests 2024-04-30 18:48:39 -07:00
Krrish Dholakia
1baad80c7d fix(router.py): cooldown deployments, for 401 errors 2024-04-30 17:54:00 -07:00
Krrish Dholakia
b9c0b55e7c test: fix test - set num_retries=0 2024-04-27 21:02:19 -07:00
Krrish Dholakia
54241f2551 test(test_router_fallbacks.py): fix testing 2024-04-25 17:43:40 -07:00
Krrish Dholakia
9735250db7 fix(router.py): support fallbacks / retries with sync embedding calls 2024-03-11 14:51:22 -07:00
ishaan-jaff
5807718a69 (test) usage based routing with fallbacks 2024-01-22 14:21:30 -08:00
ishaan-jaff
0fa9d8af82 (test) custom cooldown times - router 2024-01-19 19:41:52 -08:00
ishaan-jaff
0196ac6376 (test) router-fallbacks 2024-01-19 14:12:09 -08:00
ishaan-jaff
57a20100e4 (test) usage based routing 2024-01-19 13:58:08 -08:00
ishaan-jaff
912e2de364 (test) usage based router + fallbacks 2024-01-19 12:34:08 -08:00
Krrish Dholakia
a37a18ca80 feat(router.py): add support for retry/fallbacks for async embedding calls 2024-01-02 11:54:28 +05:30
Krrish Dholakia
e516cfe9f5 fix(utils.py): allow text completion input to be either model or engine 2023-12-27 17:24:16 +05:30
Krrish Dholakia
10a74d02c1 test(test_router_fallbacks.py): fix test to check previous models in pre_api_call not on success 2023-12-27 16:34:40 +05:30
Krrish Dholakia
6d63c0015b test(test_router_fallbacks.py): add testing for sync streaming fallbacks 2023-12-27 15:10:43 +05:30
Krrish Dholakia
c9fdbaf898 fix(azure.py,-openai.py): correctly raise errors if streaming calls fail 2023-12-27 15:08:37 +05:30
Krrish Dholakia
4905929de3 refactor: add black formatting 2023-12-25 14:11:20 +05:30
Krrish Dholakia
9cf5ab468f fix(router.py): deepcopy initial model list, don't mutate it 2023-12-12 09:54:06 -08:00
ishaan-jaff
f5d64a4992 (fix) test router 2023-12-12 09:50:44 -08:00
Krrish Dholakia
b3600e9e8a test(test_router_fallbacks.py): update to use async logging 2023-12-11 20:12:29 -08:00
Krrish Dholakia
7b83238cb5 fix(router.py): log when a call is retried or fallback happens 2023-12-05 21:29:58 -08:00
Krrish Dholakia
fa713abfc3 fix(router.py): check for fallbacks in completion params for router 2023-11-25 18:46:45 -08:00
Krrish Dholakia
2e8d582a34 fix(proxy_server.py): fix linting issues 2023-11-24 11:39:01 -08:00
Krrish Dholakia
4a5dae3941 fix(main.py): fix streaming_chunk_builder to return usage 2023-11-24 11:27:04 -08:00
Krrish Dholakia
9a44433844 feat(proxy_server.py): /key/delete endpoint 2023-11-23 21:37:53 -08:00
Krrish Dholakia
c273d6f0d6 fix(router.py): add support for context window fallbacks on router 2023-11-23 16:43:02 -08:00
Krrish Dholakia
afac42e93a fix(router.py): enable async completions with model fallbacks 2023-11-23 16:15:57 -08:00
Krrish Dholakia
8ac03e492f fix(router.py): enable fallbacks for sync completions 2023-11-23 16:06:46 -08:00