Commit graph

976 commits

Author SHA1 Message Date
ishaan-jaff
f026ed4772 (test) test seed, response format, for gpt-3.5-turbo 2023-11-17 14:00:42 -08:00
ishaan-jaff
bd82559553 (v1.0+ breaking change) get_max_tokens -> return int 2023-11-17 10:38:50 -08:00
Krrish Dholakia
3d45d8a58c test: load test router 2023-11-17 08:23:44 -08:00
ishaan-jaff
c7aba49d83 (ci/cd) re run pipeline 2023-11-17 08:07:02 -08:00
ishaan-jaff
3e2750ffb9 (ci/cd) re run pipeline 2023-11-16 21:55:10 -08:00
ishaan-jaff
25b2bc6da9 (test) add --debug to cli tool 2023-11-16 14:46:26 -08:00
Krrish Dholakia
51bf637656 feat: global client for sync + async calls (openai + Azure only) 2023-11-16 14:44:13 -08:00
Krrish Dholakia
a94c09c13c fix(openai.py): handling extra headers 2023-11-16 12:48:21 -08:00
ishaan-jaff
2dc411fdb3 (test) hf streaming 2023-11-16 12:24:31 -08:00
Krrish Dholakia
f582189cea test(loadtest_router.py): commenting out of ci/cd 2023-11-16 12:17:25 -08:00
Krrish Dholakia
bf0f8b824c fix(azure.py): use openai client sdk for handling sync+async calling 2023-11-16 12:08:12 -08:00
ishaan-jaff
3285113d2d (test) regular hf tests 2023-11-16 12:00:49 -08:00
ishaan-jaff
baf4e83738 (test) text_completion 2023-11-16 11:37:46 -08:00
ishaan-jaff
a1223e1f55 (test) proxy cli 2023-11-16 11:19:09 -08:00
ishaan-jaff
b607e5eb2a (test) proxy cli test 2023-11-16 11:13:39 -08:00
Krrish Dholakia
a23c0a2599 fix(openai.py): fix linting issues 2023-11-16 11:01:28 -08:00
ishaan-jaff
6b14c8d2de (test) langfuse logger 2023-11-16 10:45:35 -08:00
Krrish Dholakia
bb51216846 fix(openai.py): supporting openai client sdk for handling sync + async calls (incl. for openai-compatible apis) 2023-11-16 10:35:03 -08:00
ishaan-jaff
b283dd2a07 (test) add proxy cli testing 2023-11-16 09:51:17 -08:00
Krrish Dholakia
93aae8669d fix(openai.py): using openai sdk for completion calls 2023-11-15 20:25:27 -08:00
Krrish Dholakia
da68e1ea81 perf(openai.py): extending request timeout 2023-11-15 18:45:45 -08:00
ishaan-jaff
15439a6aa9 (chore) testing reset callbacks 2023-11-15 18:29:03 -08:00
Krrish Dholakia
fc8b54eb73 fix(openai.py): switch back to using requests instead of httpx 2023-11-15 18:25:21 -08:00
ishaan-jaff
5990d046c4 (chore) ci/cd testing 2023-11-15 18:07:04 -08:00
Krrish Dholakia
2750a26633 fix(openai.py): fix timeout errors 2023-11-15 18:03:14 -08:00
ishaan-jaff
3945bfadb7 (chore) remove bloat caching tests 2023-11-15 17:58:58 -08:00
Krrish Dholakia
a3d280baa3 test: cleanup testing 2023-11-15 17:57:14 -08:00
ishaan-jaff
a48445c11c (chore) speed up testing 2023-11-15 17:56:03 -08:00
ishaan-jaff
985220cc09 (chore) remove bloat 2023-11-15 17:52:57 -08:00
Krrish Dholakia
9fd7c5b343 test: set request timeout at request level 2023-11-15 17:42:31 -08:00
ishaan-jaff
eeec8a1765 (chore) remove bloat testing 2023-11-15 17:40:24 -08:00
ishaan-jaff
a7c31d54fa (fix) testing model alias map 2023-11-15 17:40:15 -08:00
ishaan-jaff
6f0b34c771 (chore) remvoe bloat testing 2023-11-15 17:33:15 -08:00
ishaan-jaff
bba0a2dcf7 (chore) remove bloat 2023-11-15 17:16:14 -08:00
Krrish Dholakia
03efc9185e fix(huggingface_restapi.py): async implementation 2023-11-15 16:54:15 -08:00
Krrish Dholakia
f84db3ce14 feat(get_max_tokens): get max tokens for huggingface hub models 2023-11-15 15:25:40 -08:00
Krrish Dholakia
1a705bfbcb refactor(huggingface_restapi.py): moving async completion + streaming to real async calls 2023-11-15 15:14:21 -08:00
Krrish Dholakia
c8b73d90f1 fix(utils.py): fix langfuse integration 2023-11-15 14:05:40 -08:00
Krrish Dholakia
8c104e9c6a fix(azure.py-+-proxy_server.py): fix function calling response object + support router on proxy 2023-11-15 13:15:16 -08:00
Krrish Dholakia
563a3d3016 refactor(router.py): renaming variable 2023-11-15 12:31:29 -08:00
Krrish Dholakia
03303033e5 feat(router.py): enable passing chat completion params for Router.chat.completion.create 2023-11-15 12:28:16 -08:00
Krrish Dholakia
f5abde775b test(test_completion.py): fix palm test 2023-11-14 22:28:56 -08:00
Krrish Dholakia
e07bf0a8de fix(utils.py): await async function in client wrapper 2023-11-14 22:07:28 -08:00
Krrish Dholakia
59eaeba92a fix(router.py): enabling retrying with expo backoff (without tenacity) for router 2023-11-14 20:57:51 -08:00
Krrish Dholakia
1738341dcb fix(main.py): misrouting ollama models to nlp cloud 2023-11-14 18:55:08 -08:00
Krrish Dholakia
a7222f257c fix(main.py): keep client consistent across calls + exponential backoff retry on ratelimit errors 2023-11-14 16:26:05 -08:00
ishaan-jaff
d80e2310ed (test) AI21 test 2023-11-14 15:01:50 -08:00
ishaan-jaff
59d217f373 (test) completion azure_ad_token 2023-11-14 12:48:31 -08:00
ishaan-jaff
dd50aeddbb (test) azure_ad_token 2023-11-14 12:12:14 -08:00
Krrish Dholakia
e9e86cac79 fix(palm.py): exception mapping bad requests / filtered responses 2023-11-14 11:53:13 -08:00