Commit graph

1562 commits

Author SHA1 Message Date
Krrish Dholakia
2a033fd8a2 test(test_router_cooldowns.py): adding logging 2023-11-24 12:30:08 -08:00
Krrish Dholakia
3dcbf6197a test(test_router.py): reintegrate router testing 2023-11-24 11:48:24 -08:00
Krrish Dholakia
c75e90663c test(test_stream_chunk_builder.py): fix setting api key 2023-11-24 11:47:48 -08:00
Krrish Dholakia
2e8d582a34 fix(proxy_server.py): fix linting issues 2023-11-24 11:39:01 -08:00
Krrish Dholakia
4a5dae3941 fix(main.py): fix streaming_chunk_builder to return usage 2023-11-24 11:27:04 -08:00
ishaan-jaff
19fb24cd15 (feat) cost tracking for azure llms 2023-11-23 21:41:38 -08:00
Krrish Dholakia
9a44433844 feat(proxy_server.py): /key/delete endpoint 2023-11-23 21:37:53 -08:00
ishaan-jaff
0c210cc96c (test) caching 2023-11-23 21:23:41 -08:00
ishaan-jaff
c929c274f6 (test) proxy: test_embedding 2023-11-23 21:16:51 -08:00
ishaan-jaff
ba9dbbe6d5 (test) router: acompletion + caching 2023-11-23 21:05:48 -08:00
ishaan-jaff
39072bd196 (test) proxy - azure/chat/completion test 2023-11-23 20:56:41 -08:00
Krrish Dholakia
187403c5cc fix(router.py): add modelgroup to call metadata 2023-11-23 20:55:49 -08:00
ishaan-jaff
db8ed601b5 (test) temp comment router to test ci/cd 2023-11-23 20:28:43 -08:00
ishaan-jaff
8a72487dcf (test) proxy_server /chat/completions 2023-11-23 20:28:06 -08:00
Krrish Dholakia
7d221fe863 fix(utils.py): make failure logging sync 2023-11-23 20:19:27 -08:00
ishaan-jaff
4816232674 (test) cost tracker 2023-11-23 18:28:37 -08:00
ishaan-jaff
b6f8dbbc0e (test) caching 2023-11-23 18:17:12 -08:00
Krrish Dholakia
94dc3f66f3 fix(utils.py): remove eos token for zephyr models 2023-11-23 17:47:39 -08:00
Krrish Dholakia
f24786095a fix(vertex_ai.py): fix exception mapping for vertex ai 2023-11-23 17:35:33 -08:00
ishaan-jaff
704af2ca34 (test) testing fix 2023-11-23 17:20:22 -08:00
ishaan-jaff
3118101e2c (test) router: raises exception 2023-11-23 17:09:36 -08:00
ishaan-jaff
3ee141e1e2 (test) cleanup 2023-11-23 17:09:36 -08:00
Krrish Dholakia
abe2a009f9 test(test_completion.py): fix azure read from env variable test 2023-11-23 16:58:07 -08:00
ishaan-jaff
60dab0d700 (test) router: azure pass key in model list 2023-11-23 16:51:52 -08:00
ishaan-jaff
f7a5b31b96 (test) azure test passing key as completion param 2023-11-23 16:51:52 -08:00
Krrish Dholakia
c273d6f0d6 fix(router.py): add support for context window fallbacks on router 2023-11-23 16:43:02 -08:00
ishaan-jaff
a1bb880872 (test) router raise exceptions 2023-11-23 16:28:19 -08:00
Krrish Dholakia
afac42e93a fix(router.py): enable async completions with model fallbacks 2023-11-23 16:15:57 -08:00
ishaan-jaff
1365092c89 (test) router: ensure errors are raised 2023-11-23 16:08:59 -08:00
ishaan-jaff
525e01824a (test) ensure exception raised when incorrect key passed 2023-11-23 16:08:59 -08:00
Krrish Dholakia
8ac03e492f fix(router.py): enable fallbacks for sync completions 2023-11-23 16:06:46 -08:00
ishaan-jaff
bb00595429 (feat) cost: azure gpt + testing 2023-11-23 14:20:48 -08:00
ishaan-jaff
90a9116a17 (test) strict test ft:gpt-3.5 cost calc 2023-11-23 14:11:01 -08:00
ishaan-jaff
da7c07574f (test) cost calc on azure 2023-11-23 13:50:09 -08:00
Krrish Dholakia
4f183dc6a0 fix(utils.py): support reading api keys dynamically from the os environment 2023-11-23 13:41:56 -08:00
ishaan-jaff
2c50ea94c8 (test) embedding: azurea - dd response_ms 2023-11-23 11:15:52 -08:00
Krrish Dholakia
61fc76a8c4 fix(router.py): fix caching for tracking cooldowns + usage 2023-11-23 11:13:32 -08:00
ishaan-jaff
fbd2ab4c71 (test) caching ensure we always test add/get caching redis 2023-11-23 08:27:44 -08:00
ishaan-jaff
db146bc40a (test) router with fallback deployments 2023-11-22 20:52:56 -08:00
ishaan-jaff
8ebc1b974c (chore) run ci/cd again 2023-11-22 20:34:14 -08:00
Krrish Dholakia
5d5ca9f7ef fix(router.py): add support for cooldowns with redis 2023-11-22 19:54:22 -08:00
Krrish Dholakia
3e76d4b422 feat(router.py): add server cooldown logic 2023-11-22 15:59:48 -08:00
ishaan-jaff
c38782521c (test)timeout error on openai embedding 2023-11-22 14:25:55 -08:00
ishaan-jaff
40e88eec4b (test)timeout errors 2023-11-22 14:25:55 -08:00
ishaan-jaff
3059f30672 (test) verify azure response have expected keys 2023-11-22 14:25:55 -08:00
Krrish Dholakia
0b4e10e068 test(test_embedding.py): fix the embedding test 2023-11-22 14:09:45 -08:00
ishaan-jaff
40dd38508f (test) embedding stricter testing 2023-11-22 13:50:45 -08:00
Krrish Dholakia
b0801f61e6 test(test_caching.py): cleaning up tests 2023-11-22 13:43:48 -08:00
Krrish Dholakia
78582e158a fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints 2023-11-22 13:22:58 -08:00
Krrish Dholakia
604ad41eac fix(proxy_server.py): If master key is set, only master key can be used to generate new keys 2023-11-22 10:18:28 -08:00