Commit graph

1706 commits

Author SHA1 Message Date
ishaan-jaff
160d8f9c55 (docs) test proxy 2023-11-17 10:19:12 -08:00
Krrish Dholakia
81764ca4f6 test: load test router 2023-11-17 08:23:44 -08:00
ishaan-jaff
eb18195424 (ci/cd) re run pipeline 2023-11-17 08:07:02 -08:00
Krrish Dholakia
1e0560e4d2 fix(utils.py): improve exception mapping for vertex ai 2023-11-16 22:02:26 -08:00
ishaan-jaff
0b051e56ba (ci/cd) re run pipeline 2023-11-16 21:55:10 -08:00
Krrish Dholakia
a753487d79 fix(router.py): check if async response is coroutine 2023-11-16 21:53:35 -08:00
Krrish Dholakia
d9123ea2e8 docs(routing.md): update tutorial on deploying router 2023-11-16 21:46:43 -08:00
Ishaan Jaff
c4f51594d5 (ci/cd) run again 2023-11-16 21:29:18 -08:00
Krrish Dholakia
81f6732fe5 fix(factory.py): for ollama models check if it's instruct or not before applying prompt template 2023-11-16 15:45:08 -08:00
ishaan-jaff
e5cb8013a1 (test) add --debug to cli tool 2023-11-16 14:46:26 -08:00
Krrish Dholakia
48a508bab6 feat: global client for sync + async calls (openai + Azure only) 2023-11-16 14:44:13 -08:00
Krrish Dholakia
7a71385e53 fix(openai.py): fix linting issues 2023-11-16 12:57:53 -08:00
Krrish Dholakia
a6e9f147d3 fix(openai.py): handling extra headers 2023-11-16 12:48:21 -08:00
ishaan-jaff
56838ee815 (fix) bedrock meta llama optional params 2023-11-16 12:38:27 -08:00
ishaan-jaff
7ba8271250 (linting) fix 2023-11-16 12:33:03 -08:00
ishaan-jaff
6645c510ac (fix) linting 2023-11-16 12:25:46 -08:00
ishaan-jaff
945761c36b (test) hf streaming 2023-11-16 12:24:31 -08:00
ishaan-jaff
55a054f3f6 (fix) only decode chunk when it's not a str 2023-11-16 12:24:31 -08:00
Krrish Dholakia
95642df0a4 test(loadtest_router.py): commenting out of ci/cd 2023-11-16 12:17:25 -08:00
Krrish Dholakia
9c2e74b6ea fix(azure.py): fix linting errors 2023-11-16 12:15:50 -08:00
Krrish Dholakia
e54056f0ed fix(azure.py): use openai client sdk for handling sync+async calling 2023-11-16 12:08:12 -08:00
ishaan-jaff
bef9ead3a6 (test) regular hf tests 2023-11-16 12:00:49 -08:00
ishaan-jaff
aa84ca04d8 (fix) HF api + streaming 2023-11-16 11:59:56 -08:00
ishaan-jaff
3b75dfe490 (fix) linting 2023-11-16 11:44:26 -08:00
ishaan-jaff
d951f849a7 (test) text_completion 2023-11-16 11:37:46 -08:00
ishaan-jaff
985f9b1ec1 (feat) text_completion add rules on when to use engine & model together 2023-11-16 11:37:31 -08:00
ishaan-jaff
521584377d (test) proxy cli 2023-11-16 11:19:09 -08:00
ishaan-jaff
f2170fa16b (test) proxy cli test 2023-11-16 11:13:39 -08:00
Krrish Dholakia
f14bd24b46 fix(openai.py): fix linting issues 2023-11-16 11:01:28 -08:00
ishaan-jaff
b2f9cb1387 (test) langfuse logger 2023-11-16 10:45:35 -08:00
ishaan-jaff
fb2d398d2c (fix) langfuse logging + openai streaming when chunk = [DONE} 2023-11-16 10:45:35 -08:00
Krrish Dholakia
9c7cc84eb0 fix(openai.py): supporting openai client sdk for handling sync + async calls (incl. for openai-compatible apis) 2023-11-16 10:35:03 -08:00
ishaan-jaff
900593355b (chore) linting fix 2023-11-16 09:57:40 -08:00
ishaan-jaff
082acfcc53 (feat) raise exception on proxy server 2023-11-16 09:51:17 -08:00
ishaan-jaff
f0c0e71208 (test) add proxy cli testing 2023-11-16 09:51:17 -08:00
Krrish Dholakia
838be2ed3f docs(simple_proxy.md): add tutorial on handling multiple instances of 1 model 2023-11-16 09:14:15 -08:00
Ishaan Jaff
da9a0ab928 Merge pull request #811 from dchristian3188/bedrock-llama
Bedrock llama
2023-11-16 07:57:50 -08:00
Ishaan Jaff
d6d0cbd63c Merge pull request #826 from rodneyxr/ollama-fixes
Fix typo for initial_prompt_value and too many values to unpack error
2023-11-16 07:55:53 -08:00
David Christian
461115330b updated utils for bedrock.meta streaming 2023-11-16 07:12:27 -08:00
Krrish Dholakia
a80c5841b2 fix(openai.py): using openai sdk for completion calls 2023-11-15 20:25:27 -08:00
Krrish Dholakia
6aef9d2db3 perf(openai.py): extending request timeout 2023-11-15 18:45:45 -08:00
Krrish Dholakia
41e1fcba96 fix(openai.py): fix linting errors 2023-11-15 18:33:01 -08:00
Krrish Dholakia
1c219106ba fix(openai.py): move chat completions post call to requests 2023-11-15 18:29:07 -08:00
ishaan-jaff
0adeb06306 (chore) testing reset callbacks 2023-11-15 18:29:03 -08:00
Krrish Dholakia
4bd471644e fix(openai.py): switch back to using requests instead of httpx 2023-11-15 18:25:21 -08:00
ishaan-jaff
0eb8ba7089 (chore) ci/cd testing 2023-11-15 18:07:04 -08:00
Krrish Dholakia
081400ecd4 fix(openai.py): fix timeout errors 2023-11-15 18:03:14 -08:00
ishaan-jaff
b397719709 (chore) remove bloat caching tests 2023-11-15 17:58:58 -08:00
Krrish Dholakia
f2020c2284 test: cleanup testing 2023-11-15 17:57:14 -08:00
ishaan-jaff
317104b555 (chore) speed up testing 2023-11-15 17:56:03 -08:00