Commit graph

1677 commits

Author SHA1 Message Date
Krrish Dholakia
3ea776bdc0 fix(text_completion): allow either model or engine to be set 2023-11-17 18:25:21 -08:00
Krrish Dholakia
8e430fcfbe test(test_async_fn.py): catch timeouts 2023-11-17 18:01:52 -08:00
Krrish Dholakia
478bd7def6 fix(router.py): introducing usage-based-routing 2023-11-17 17:56:26 -08:00
ishaan-jaff
a6a862dc5a (ci/cd) fix timeout error 2023-11-17 17:46:49 -08:00
ishaan-jaff
e006cbbc73 (docs) update readme proxy server 2023-11-17 17:40:44 -08:00
ishaan-jaff
d1af0af7bf (docs) load balancer 2023-11-17 17:25:46 -08:00
ishaan-jaff
42432bedaa (docs) add example load balancer 2023-11-17 17:25:12 -08:00
Krrish Dholakia
7376e57e9c refactor(router.py): code cleanup 2023-11-17 17:05:46 -08:00
Krrish Dholakia
5cddab9e54 test(test_langfuse.py): handle timeouts 2023-11-17 17:05:46 -08:00
ishaan-jaff
d2bac07b48 (test) parallel tool calling 2023-11-17 17:03:24 -08:00
Krrish Dholakia
985583023a test(test_completion.py): change tgai model 2023-11-17 16:06:42 -08:00
Krrish Dholakia
a99efc544e fix(test_router.py): catch timeouts 2023-11-17 15:56:06 -08:00
ishaan-jaff
bb9e7c65e9 (test) parallel function calling 2023-11-17 15:51:27 -08:00
ishaan-jaff
88200432b0 (feat) support parallel function calling 2023-11-17 15:51:27 -08:00
ishaan-jaff
32f22adf8b (feat) openai improve logging post_call 2023-11-17 15:51:27 -08:00
ishaan-jaff
7de87c845b (feat) improve logging - show model_call_details 2023-11-17 15:51:27 -08:00
Krrish Dholakia
17bb1184bd fix(main.py): fix linting issue 2023-11-17 15:45:00 -08:00
Krrish Dholakia
0ab6b2451d fix(acompletion): support client side timeouts + raise exceptions correctly for async calls 2023-11-17 15:39:47 -08:00
ishaan-jaff
f026ed4772 (test) test seed, response format, for gpt-3.5-turbo 2023-11-17 14:00:42 -08:00
ishaan-jaff
7abb65d53f (feat) completion: add response_format, seed, tools, tool_choice 2023-11-17 13:59:57 -08:00
ishaan-jaff
bd82559553 (v1.0+ breaking change) get_max_tokens -> return int 2023-11-17 10:38:50 -08:00
ishaan-jaff
c162f8b4b0 (docs) test proxy 2023-11-17 10:19:12 -08:00
Krrish Dholakia
3d45d8a58c test: load test router 2023-11-17 08:23:44 -08:00
ishaan-jaff
c7aba49d83 (ci/cd) re run pipeline 2023-11-17 08:07:02 -08:00
Krrish Dholakia
9bd1f4ebd0 fix(utils.py): improve exception mapping for vertex ai 2023-11-16 22:02:26 -08:00
ishaan-jaff
3e2750ffb9 (ci/cd) re run pipeline 2023-11-16 21:55:10 -08:00
Krrish Dholakia
75ef1d7eb4 fix(router.py): check if async response is coroutine 2023-11-16 21:53:35 -08:00
Krrish Dholakia
7456c26940 docs(routing.md): update tutorial on deploying router 2023-11-16 21:46:43 -08:00
Ishaan Jaff
61705f3467
(ci/cd) run again 2023-11-16 21:29:18 -08:00
Krrish Dholakia
7ef1014e59 fix(factory.py): for ollama models check if it's instruct or not before applying prompt template 2023-11-16 15:45:08 -08:00
ishaan-jaff
25b2bc6da9 (test) add --debug to cli tool 2023-11-16 14:46:26 -08:00
Krrish Dholakia
51bf637656 feat: global client for sync + async calls (openai + Azure only) 2023-11-16 14:44:13 -08:00
Krrish Dholakia
d7f7694848 fix(openai.py): fix linting issues 2023-11-16 12:57:53 -08:00
Krrish Dholakia
a94c09c13c fix(openai.py): handling extra headers 2023-11-16 12:48:21 -08:00
ishaan-jaff
9e072f87bd (fix) bedrock meta llama optional params 2023-11-16 12:38:27 -08:00
ishaan-jaff
23d560071b (linting) fix 2023-11-16 12:33:03 -08:00
ishaan-jaff
f8af5e0155 (fix) linting 2023-11-16 12:25:46 -08:00
ishaan-jaff
2dc411fdb3 (test) hf streaming 2023-11-16 12:24:31 -08:00
ishaan-jaff
04971674b4 (fix) only decode chunk when it's not a str 2023-11-16 12:24:31 -08:00
Krrish Dholakia
f582189cea test(loadtest_router.py): commenting out of ci/cd 2023-11-16 12:17:25 -08:00
Krrish Dholakia
f99a161d98 fix(azure.py): fix linting errors 2023-11-16 12:15:50 -08:00
Krrish Dholakia
bf0f8b824c fix(azure.py): use openai client sdk for handling sync+async calling 2023-11-16 12:08:12 -08:00
ishaan-jaff
3285113d2d (test) regular hf tests 2023-11-16 12:00:49 -08:00
ishaan-jaff
da8c2f4a4a (fix) HF api + streaming 2023-11-16 11:59:56 -08:00
ishaan-jaff
a1cecbafe6 (fix) linting 2023-11-16 11:44:26 -08:00
ishaan-jaff
baf4e83738 (test) text_completion 2023-11-16 11:37:46 -08:00
ishaan-jaff
77468e0a70 (feat) text_completion add rules on when to use engine & model together 2023-11-16 11:37:31 -08:00
ishaan-jaff
a1223e1f55 (test) proxy cli 2023-11-16 11:19:09 -08:00
ishaan-jaff
b607e5eb2a (test) proxy cli test 2023-11-16 11:13:39 -08:00
Krrish Dholakia
a23c0a2599 fix(openai.py): fix linting issues 2023-11-16 11:01:28 -08:00