Krrish Dholakia
|
1812b7ae37
|
fix(proxy_server.py): fix linting errors
|
2023-11-18 17:35:16 -08:00 |
|
Krrish Dholakia
|
59b547a65e
|
docs(simple_proxy.md): adding token based auth to docs
|
2023-11-18 17:34:11 -08:00 |
|
ishaan-jaff
|
e276023462
|
(test) palm streaming test
|
2023-11-18 17:32:15 -08:00 |
|
ishaan-jaff
|
70dc8441f6
|
(fix) streaming ensure response obj is initialized
|
2023-11-18 17:31:58 -08:00 |
|
ishaan-jaff
|
9bea4044b0
|
(test) remove unused tests
|
2023-11-18 17:25:42 -08:00 |
|
ishaan-jaff
|
2f0304f100
|
(test) async for chunk
|
2023-11-18 17:17:39 -08:00 |
|
ishaan-jaff
|
903016ba2f
|
(fix) stream chunk builder when content == None in chunk
|
2023-11-18 17:16:05 -08:00 |
|
ishaan-jaff
|
30fcc1f9a0
|
(test) do nothing if token==None in stream chunk
|
2023-11-18 17:13:16 -08:00 |
|
ishaan-jaff
|
e527a45ffc
|
(feat) print_verbose Raw openai chunk
|
2023-11-18 17:12:49 -08:00 |
|
ishaan-jaff
|
2466d26301
|
(fix) linting
|
2023-11-18 17:05:46 -08:00 |
|
ishaan-jaff
|
b2f8f8ed45
|
(test) openai streaming + function calling
|
2023-11-18 17:01:46 -08:00 |
|
ishaan-jaff
|
8f402e04c9
|
(fix) streaming openai + function calling
|
2023-11-18 17:01:46 -08:00 |
|
Krrish Dholakia
|
6a1270ce5a
|
fix(proxy_server.py): handle initializing prisma / db connection just once
|
2023-11-18 16:45:19 -08:00 |
|
ishaan-jaff
|
e10c2d0bda
|
(test) tool/function calling + streaming
|
2023-11-18 16:23:29 -08:00 |
|
ishaan-jaff
|
edf98cabae
|
(fix) streaming + function / tool calling
|
2023-11-18 16:23:29 -08:00 |
|
Krrish Dholakia
|
8e599ed4f8
|
feat(proxy_server.py): enable token based authentication for server endpoints
|
2023-11-18 16:13:53 -08:00 |
|
Krrish Dholakia
|
229e5ea083
|
fix(proxy_server.py): allow user to connect their proxy to a postgres db
|
2023-11-18 15:57:38 -08:00 |
|
ishaan-jaff
|
c9445db22f
|
(fix) langfuse logging - dont fail when casting optional params
|
2023-11-18 15:36:12 -08:00 |
|
ishaan-jaff
|
b31368ac2f
|
(test) function calling
|
2023-11-18 15:15:02 -08:00 |
|
ishaan-jaff
|
250c891750
|
(fix) langfuse logger for function calling / tools
|
2023-11-18 15:14:44 -08:00 |
|
Krrish Dholakia
|
34decba3cf
|
fix(text_completion): allow either model or engine to be set
|
2023-11-17 18:25:21 -08:00 |
|
Krrish Dholakia
|
7ea80de2f1
|
test(test_async_fn.py): catch timeouts
|
2023-11-17 18:01:52 -08:00 |
|
Krrish Dholakia
|
cf0a9f591c
|
fix(router.py): introducing usage-based-routing
|
2023-11-17 17:56:26 -08:00 |
|
ishaan-jaff
|
259c1c7616
|
(ci/cd) fix timeout error
|
2023-11-17 17:46:49 -08:00 |
|
ishaan-jaff
|
505a0c354c
|
(docs) update readme proxy server
|
2023-11-17 17:40:44 -08:00 |
|
ishaan-jaff
|
bae94aced0
|
(docs) load balancer
|
2023-11-17 17:25:46 -08:00 |
|
ishaan-jaff
|
a9bb85489f
|
(docs) add example load balancer
|
2023-11-17 17:25:12 -08:00 |
|
Krrish Dholakia
|
452946b2f8
|
refactor(router.py): code cleanup
|
2023-11-17 17:05:46 -08:00 |
|
Krrish Dholakia
|
7d70bf84a7
|
test(test_langfuse.py): handle timeouts
|
2023-11-17 17:05:46 -08:00 |
|
ishaan-jaff
|
1ba32368ef
|
(test) parallel tool calling
|
2023-11-17 17:03:24 -08:00 |
|
Krrish Dholakia
|
29906f6320
|
test(test_completion.py): change tgai model
|
2023-11-17 16:06:42 -08:00 |
|
Krrish Dholakia
|
237e6a4c32
|
fix(test_router.py): catch timeouts
|
2023-11-17 15:56:06 -08:00 |
|
ishaan-jaff
|
6d3248e17f
|
(test) parallel function calling
|
2023-11-17 15:51:27 -08:00 |
|
ishaan-jaff
|
0ba90475c9
|
(feat) support parallel function calling
|
2023-11-17 15:51:27 -08:00 |
|
ishaan-jaff
|
b494990045
|
(feat) openai improve logging post_call
|
2023-11-17 15:51:27 -08:00 |
|
ishaan-jaff
|
698f47c226
|
(feat) improve logging - show model_call_details
|
2023-11-17 15:51:27 -08:00 |
|
Krrish Dholakia
|
aa6e6865cd
|
fix(main.py): fix linting issue
|
2023-11-17 15:45:00 -08:00 |
|
Krrish Dholakia
|
02ed97d0b2
|
fix(acompletion): support client side timeouts + raise exceptions correctly for async calls
|
2023-11-17 15:39:47 -08:00 |
|
ishaan-jaff
|
b3e36d0486
|
(test) test seed, response format, for gpt-3.5-turbo
|
2023-11-17 14:00:42 -08:00 |
|
ishaan-jaff
|
ef8d82a54c
|
(feat) completion: add response_format, seed, tools, tool_choice
|
2023-11-17 13:59:57 -08:00 |
|
ishaan-jaff
|
e9f6741b0b
|
(v1.0+ breaking change) get_max_tokens -> return int
|
2023-11-17 10:38:50 -08:00 |
|
ishaan-jaff
|
160d8f9c55
|
(docs) test proxy
|
2023-11-17 10:19:12 -08:00 |
|
Krrish Dholakia
|
81764ca4f6
|
test: load test router
|
2023-11-17 08:23:44 -08:00 |
|
ishaan-jaff
|
eb18195424
|
(ci/cd) re run pipeline
|
2023-11-17 08:07:02 -08:00 |
|
Krrish Dholakia
|
1e0560e4d2
|
fix(utils.py): improve exception mapping for vertex ai
|
2023-11-16 22:02:26 -08:00 |
|
ishaan-jaff
|
0b051e56ba
|
(ci/cd) re run pipeline
|
2023-11-16 21:55:10 -08:00 |
|
Krrish Dholakia
|
a753487d79
|
fix(router.py): check if async response is coroutine
|
2023-11-16 21:53:35 -08:00 |
|
Krrish Dholakia
|
d9123ea2e8
|
docs(routing.md): update tutorial on deploying router
|
2023-11-16 21:46:43 -08:00 |
|
Ishaan Jaff
|
c4f51594d5
|
(ci/cd) run again
|
2023-11-16 21:29:18 -08:00 |
|
Krrish Dholakia
|
81f6732fe5
|
fix(factory.py): for ollama models check if it's instruct or not before applying prompt template
|
2023-11-16 15:45:08 -08:00 |
|