Commit graph

1697 commits

Author SHA1 Message Date
Krrish Dholakia
1812b7ae37 fix(proxy_server.py): fix linting errors 2023-11-18 17:35:16 -08:00
Krrish Dholakia
59b547a65e docs(simple_proxy.md): adding token based auth to docs 2023-11-18 17:34:11 -08:00
ishaan-jaff
e276023462 (test) palm streaming test 2023-11-18 17:32:15 -08:00
ishaan-jaff
70dc8441f6 (fix) streaming ensure response obj is initialized 2023-11-18 17:31:58 -08:00
ishaan-jaff
9bea4044b0 (test) remove unused tests 2023-11-18 17:25:42 -08:00
ishaan-jaff
2f0304f100 (test) async for chunk 2023-11-18 17:17:39 -08:00
ishaan-jaff
903016ba2f (fix) stream chunk builder when content == None in chunk 2023-11-18 17:16:05 -08:00
ishaan-jaff
30fcc1f9a0 (test) do nothing if token==None in stream chunk 2023-11-18 17:13:16 -08:00
ishaan-jaff
e527a45ffc (feat) print_verbose Raw openai chunk 2023-11-18 17:12:49 -08:00
ishaan-jaff
2466d26301 (fix) linting 2023-11-18 17:05:46 -08:00
ishaan-jaff
b2f8f8ed45 (test) openai streaming + function calling 2023-11-18 17:01:46 -08:00
ishaan-jaff
8f402e04c9 (fix) streaming openai + function calling 2023-11-18 17:01:46 -08:00
Krrish Dholakia
6a1270ce5a fix(proxy_server.py): handle initializing prisma / db connection just once 2023-11-18 16:45:19 -08:00
ishaan-jaff
e10c2d0bda (test) tool/function calling + streaming 2023-11-18 16:23:29 -08:00
ishaan-jaff
edf98cabae (fix) streaming + function / tool calling 2023-11-18 16:23:29 -08:00
Krrish Dholakia
8e599ed4f8 feat(proxy_server.py): enable token based authentication for server endpoints 2023-11-18 16:13:53 -08:00
Krrish Dholakia
229e5ea083 fix(proxy_server.py): allow user to connect their proxy to a postgres db 2023-11-18 15:57:38 -08:00
ishaan-jaff
c9445db22f (fix) langfuse logging - dont fail when casting optional params 2023-11-18 15:36:12 -08:00
ishaan-jaff
b31368ac2f (test) function calling 2023-11-18 15:15:02 -08:00
ishaan-jaff
250c891750 (fix) langfuse logger for function calling / tools 2023-11-18 15:14:44 -08:00
Krrish Dholakia
34decba3cf fix(text_completion): allow either model or engine to be set 2023-11-17 18:25:21 -08:00
Krrish Dholakia
7ea80de2f1 test(test_async_fn.py): catch timeouts 2023-11-17 18:01:52 -08:00
Krrish Dholakia
cf0a9f591c fix(router.py): introducing usage-based-routing 2023-11-17 17:56:26 -08:00
ishaan-jaff
259c1c7616 (ci/cd) fix timeout error 2023-11-17 17:46:49 -08:00
ishaan-jaff
505a0c354c (docs) update readme proxy server 2023-11-17 17:40:44 -08:00
ishaan-jaff
bae94aced0 (docs) load balancer 2023-11-17 17:25:46 -08:00
ishaan-jaff
a9bb85489f (docs) add example load balancer 2023-11-17 17:25:12 -08:00
Krrish Dholakia
452946b2f8 refactor(router.py): code cleanup 2023-11-17 17:05:46 -08:00
Krrish Dholakia
7d70bf84a7 test(test_langfuse.py): handle timeouts 2023-11-17 17:05:46 -08:00
ishaan-jaff
1ba32368ef (test) parallel tool calling 2023-11-17 17:03:24 -08:00
Krrish Dholakia
29906f6320 test(test_completion.py): change tgai model 2023-11-17 16:06:42 -08:00
Krrish Dholakia
237e6a4c32 fix(test_router.py): catch timeouts 2023-11-17 15:56:06 -08:00
ishaan-jaff
6d3248e17f (test) parallel function calling 2023-11-17 15:51:27 -08:00
ishaan-jaff
0ba90475c9 (feat) support parallel function calling 2023-11-17 15:51:27 -08:00
ishaan-jaff
b494990045 (feat) openai improve logging post_call 2023-11-17 15:51:27 -08:00
ishaan-jaff
698f47c226 (feat) improve logging - show model_call_details 2023-11-17 15:51:27 -08:00
Krrish Dholakia
aa6e6865cd fix(main.py): fix linting issue 2023-11-17 15:45:00 -08:00
Krrish Dholakia
02ed97d0b2 fix(acompletion): support client side timeouts + raise exceptions correctly for async calls 2023-11-17 15:39:47 -08:00
ishaan-jaff
b3e36d0486 (test) test seed, response format, for gpt-3.5-turbo 2023-11-17 14:00:42 -08:00
ishaan-jaff
ef8d82a54c (feat) completion: add response_format, seed, tools, tool_choice 2023-11-17 13:59:57 -08:00
ishaan-jaff
e9f6741b0b (v1.0+ breaking change) get_max_tokens -> return int 2023-11-17 10:38:50 -08:00
ishaan-jaff
160d8f9c55 (docs) test proxy 2023-11-17 10:19:12 -08:00
Krrish Dholakia
81764ca4f6 test: load test router 2023-11-17 08:23:44 -08:00
ishaan-jaff
eb18195424 (ci/cd) re run pipeline 2023-11-17 08:07:02 -08:00
Krrish Dholakia
1e0560e4d2 fix(utils.py): improve exception mapping for vertex ai 2023-11-16 22:02:26 -08:00
ishaan-jaff
0b051e56ba (ci/cd) re run pipeline 2023-11-16 21:55:10 -08:00
Krrish Dholakia
a753487d79 fix(router.py): check if async response is coroutine 2023-11-16 21:53:35 -08:00
Krrish Dholakia
d9123ea2e8 docs(routing.md): update tutorial on deploying router 2023-11-16 21:46:43 -08:00
Ishaan Jaff
c4f51594d5 (ci/cd) run again 2023-11-16 21:29:18 -08:00
Krrish Dholakia
81f6732fe5 fix(factory.py): for ollama models check if it's instruct or not before applying prompt template 2023-11-16 15:45:08 -08:00