Krrish Dholakia
|
3ea776bdc0
|
fix(text_completion): allow either model or engine to be set
|
2023-11-17 18:25:21 -08:00 |
|
Krrish Dholakia
|
8e430fcfbe
|
test(test_async_fn.py): catch timeouts
|
2023-11-17 18:01:52 -08:00 |
|
Krrish Dholakia
|
478bd7def6
|
fix(router.py): introducing usage-based-routing
|
2023-11-17 17:56:26 -08:00 |
|
ishaan-jaff
|
a6a862dc5a
|
(ci/cd) fix timeout error
|
2023-11-17 17:46:49 -08:00 |
|
ishaan-jaff
|
e006cbbc73
|
(docs) update readme proxy server
|
2023-11-17 17:40:44 -08:00 |
|
ishaan-jaff
|
d1af0af7bf
|
(docs) load balancer
|
2023-11-17 17:25:46 -08:00 |
|
ishaan-jaff
|
42432bedaa
|
(docs) add example load balancer
|
2023-11-17 17:25:12 -08:00 |
|
Krrish Dholakia
|
7376e57e9c
|
refactor(router.py): code cleanup
|
2023-11-17 17:05:46 -08:00 |
|
Krrish Dholakia
|
5cddab9e54
|
test(test_langfuse.py): handle timeouts
|
2023-11-17 17:05:46 -08:00 |
|
ishaan-jaff
|
d2bac07b48
|
(test) parallel tool calling
|
2023-11-17 17:03:24 -08:00 |
|
Krrish Dholakia
|
985583023a
|
test(test_completion.py): change tgai model
|
2023-11-17 16:06:42 -08:00 |
|
Krrish Dholakia
|
a99efc544e
|
fix(test_router.py): catch timeouts
|
2023-11-17 15:56:06 -08:00 |
|
ishaan-jaff
|
bb9e7c65e9
|
(test) parallel function calling
|
2023-11-17 15:51:27 -08:00 |
|
ishaan-jaff
|
88200432b0
|
(feat) support parallel function calling
|
2023-11-17 15:51:27 -08:00 |
|
ishaan-jaff
|
32f22adf8b
|
(feat) openai improve logging post_call
|
2023-11-17 15:51:27 -08:00 |
|
ishaan-jaff
|
7de87c845b
|
(feat) improve logging - show model_call_details
|
2023-11-17 15:51:27 -08:00 |
|
Krrish Dholakia
|
17bb1184bd
|
fix(main.py): fix linting issue
|
2023-11-17 15:45:00 -08:00 |
|
Krrish Dholakia
|
0ab6b2451d
|
fix(acompletion): support client side timeouts + raise exceptions correctly for async calls
|
2023-11-17 15:39:47 -08:00 |
|
ishaan-jaff
|
f026ed4772
|
(test) test seed, response format, for gpt-3.5-turbo
|
2023-11-17 14:00:42 -08:00 |
|
ishaan-jaff
|
7abb65d53f
|
(feat) completion: add response_format, seed, tools, tool_choice
|
2023-11-17 13:59:57 -08:00 |
|
ishaan-jaff
|
bd82559553
|
(v1.0+ breaking change) get_max_tokens -> return int
|
2023-11-17 10:38:50 -08:00 |
|
ishaan-jaff
|
c162f8b4b0
|
(docs) test proxy
|
2023-11-17 10:19:12 -08:00 |
|
Krrish Dholakia
|
3d45d8a58c
|
test: load test router
|
2023-11-17 08:23:44 -08:00 |
|
ishaan-jaff
|
c7aba49d83
|
(ci/cd) re run pipeline
|
2023-11-17 08:07:02 -08:00 |
|
Krrish Dholakia
|
9bd1f4ebd0
|
fix(utils.py): improve exception mapping for vertex ai
|
2023-11-16 22:02:26 -08:00 |
|
ishaan-jaff
|
3e2750ffb9
|
(ci/cd) re run pipeline
|
2023-11-16 21:55:10 -08:00 |
|
Krrish Dholakia
|
75ef1d7eb4
|
fix(router.py): check if async response is coroutine
|
2023-11-16 21:53:35 -08:00 |
|
Krrish Dholakia
|
7456c26940
|
docs(routing.md): update tutorial on deploying router
|
2023-11-16 21:46:43 -08:00 |
|
Ishaan Jaff
|
61705f3467
|
(ci/cd) run again
|
2023-11-16 21:29:18 -08:00 |
|
Krrish Dholakia
|
7ef1014e59
|
fix(factory.py): for ollama models check if it's instruct or not before applying prompt template
|
2023-11-16 15:45:08 -08:00 |
|
ishaan-jaff
|
25b2bc6da9
|
(test) add --debug to cli tool
|
2023-11-16 14:46:26 -08:00 |
|
Krrish Dholakia
|
51bf637656
|
feat: global client for sync + async calls (openai + Azure only)
|
2023-11-16 14:44:13 -08:00 |
|
Krrish Dholakia
|
d7f7694848
|
fix(openai.py): fix linting issues
|
2023-11-16 12:57:53 -08:00 |
|
Krrish Dholakia
|
a94c09c13c
|
fix(openai.py): handling extra headers
|
2023-11-16 12:48:21 -08:00 |
|
ishaan-jaff
|
9e072f87bd
|
(fix) bedrock meta llama optional params
|
2023-11-16 12:38:27 -08:00 |
|
ishaan-jaff
|
23d560071b
|
(linting) fix
|
2023-11-16 12:33:03 -08:00 |
|
ishaan-jaff
|
f8af5e0155
|
(fix) linting
|
2023-11-16 12:25:46 -08:00 |
|
ishaan-jaff
|
2dc411fdb3
|
(test) hf streaming
|
2023-11-16 12:24:31 -08:00 |
|
ishaan-jaff
|
04971674b4
|
(fix) only decode chunk when it's not a str
|
2023-11-16 12:24:31 -08:00 |
|
Krrish Dholakia
|
f582189cea
|
test(loadtest_router.py): commenting out of ci/cd
|
2023-11-16 12:17:25 -08:00 |
|
Krrish Dholakia
|
f99a161d98
|
fix(azure.py): fix linting errors
|
2023-11-16 12:15:50 -08:00 |
|
Krrish Dholakia
|
bf0f8b824c
|
fix(azure.py): use openai client sdk for handling sync+async calling
|
2023-11-16 12:08:12 -08:00 |
|
ishaan-jaff
|
3285113d2d
|
(test) regular hf tests
|
2023-11-16 12:00:49 -08:00 |
|
ishaan-jaff
|
da8c2f4a4a
|
(fix) HF api + streaming
|
2023-11-16 11:59:56 -08:00 |
|
ishaan-jaff
|
a1cecbafe6
|
(fix) linting
|
2023-11-16 11:44:26 -08:00 |
|
ishaan-jaff
|
baf4e83738
|
(test) text_completion
|
2023-11-16 11:37:46 -08:00 |
|
ishaan-jaff
|
77468e0a70
|
(feat) text_completion add rules on when to use engine & model together
|
2023-11-16 11:37:31 -08:00 |
|
ishaan-jaff
|
a1223e1f55
|
(test) proxy cli
|
2023-11-16 11:19:09 -08:00 |
|
ishaan-jaff
|
b607e5eb2a
|
(test) proxy cli test
|
2023-11-16 11:13:39 -08:00 |
|
Krrish Dholakia
|
a23c0a2599
|
fix(openai.py): fix linting issues
|
2023-11-16 11:01:28 -08:00 |
|