ishaan-jaff
|
160d8f9c55
|
(docs) test proxy
|
2023-11-17 10:19:12 -08:00 |
|
Krrish Dholakia
|
81764ca4f6
|
test: load test router
|
2023-11-17 08:23:44 -08:00 |
|
ishaan-jaff
|
eb18195424
|
(ci/cd) re run pipeline
|
2023-11-17 08:07:02 -08:00 |
|
Krrish Dholakia
|
1e0560e4d2
|
fix(utils.py): improve exception mapping for vertex ai
|
2023-11-16 22:02:26 -08:00 |
|
ishaan-jaff
|
0b051e56ba
|
(ci/cd) re run pipeline
|
2023-11-16 21:55:10 -08:00 |
|
Krrish Dholakia
|
a753487d79
|
fix(router.py): check if async response is coroutine
|
2023-11-16 21:53:35 -08:00 |
|
Krrish Dholakia
|
d9123ea2e8
|
docs(routing.md): update tutorial on deploying router
|
2023-11-16 21:46:43 -08:00 |
|
Ishaan Jaff
|
c4f51594d5
|
(ci/cd) run again
|
2023-11-16 21:29:18 -08:00 |
|
Krrish Dholakia
|
81f6732fe5
|
fix(factory.py): for ollama models check if it's instruct or not before applying prompt template
|
2023-11-16 15:45:08 -08:00 |
|
ishaan-jaff
|
e5cb8013a1
|
(test) add --debug to cli tool
|
2023-11-16 14:46:26 -08:00 |
|
Krrish Dholakia
|
48a508bab6
|
feat: global client for sync + async calls (openai + Azure only)
|
2023-11-16 14:44:13 -08:00 |
|
Krrish Dholakia
|
7a71385e53
|
fix(openai.py): fix linting issues
|
2023-11-16 12:57:53 -08:00 |
|
Krrish Dholakia
|
a6e9f147d3
|
fix(openai.py): handling extra headers
|
2023-11-16 12:48:21 -08:00 |
|
ishaan-jaff
|
56838ee815
|
(fix) bedrock meta llama optional params
|
2023-11-16 12:38:27 -08:00 |
|
ishaan-jaff
|
7ba8271250
|
(linting) fix
|
2023-11-16 12:33:03 -08:00 |
|
ishaan-jaff
|
6645c510ac
|
(fix) linting
|
2023-11-16 12:25:46 -08:00 |
|
ishaan-jaff
|
945761c36b
|
(test) hf streaming
|
2023-11-16 12:24:31 -08:00 |
|
ishaan-jaff
|
55a054f3f6
|
(fix) only decode chunk when it's not a str
|
2023-11-16 12:24:31 -08:00 |
|
Krrish Dholakia
|
95642df0a4
|
test(loadtest_router.py): commenting out of ci/cd
|
2023-11-16 12:17:25 -08:00 |
|
Krrish Dholakia
|
9c2e74b6ea
|
fix(azure.py): fix linting errors
|
2023-11-16 12:15:50 -08:00 |
|
Krrish Dholakia
|
e54056f0ed
|
fix(azure.py): use openai client sdk for handling sync+async calling
|
2023-11-16 12:08:12 -08:00 |
|
ishaan-jaff
|
bef9ead3a6
|
(test) regular hf tests
|
2023-11-16 12:00:49 -08:00 |
|
ishaan-jaff
|
aa84ca04d8
|
(fix) HF api + streaming
|
2023-11-16 11:59:56 -08:00 |
|
ishaan-jaff
|
3b75dfe490
|
(fix) linting
|
2023-11-16 11:44:26 -08:00 |
|
ishaan-jaff
|
d951f849a7
|
(test) text_completion
|
2023-11-16 11:37:46 -08:00 |
|
ishaan-jaff
|
985f9b1ec1
|
(feat) text_completion add rules on when to use engine & model together
|
2023-11-16 11:37:31 -08:00 |
|
ishaan-jaff
|
521584377d
|
(test) proxy cli
|
2023-11-16 11:19:09 -08:00 |
|
ishaan-jaff
|
f2170fa16b
|
(test) proxy cli test
|
2023-11-16 11:13:39 -08:00 |
|
Krrish Dholakia
|
f14bd24b46
|
fix(openai.py): fix linting issues
|
2023-11-16 11:01:28 -08:00 |
|
ishaan-jaff
|
b2f9cb1387
|
(test) langfuse logger
|
2023-11-16 10:45:35 -08:00 |
|
ishaan-jaff
|
fb2d398d2c
|
(fix) langfuse logging + openai streaming when chunk = [DONE}
|
2023-11-16 10:45:35 -08:00 |
|
Krrish Dholakia
|
9c7cc84eb0
|
fix(openai.py): supporting openai client sdk for handling sync + async calls (incl. for openai-compatible apis)
|
2023-11-16 10:35:03 -08:00 |
|
ishaan-jaff
|
900593355b
|
(chore) linting fix
|
2023-11-16 09:57:40 -08:00 |
|
ishaan-jaff
|
082acfcc53
|
(feat) raise exception on proxy server
|
2023-11-16 09:51:17 -08:00 |
|
ishaan-jaff
|
f0c0e71208
|
(test) add proxy cli testing
|
2023-11-16 09:51:17 -08:00 |
|
Krrish Dholakia
|
838be2ed3f
|
docs(simple_proxy.md): add tutorial on handling multiple instances of 1 model
|
2023-11-16 09:14:15 -08:00 |
|
Ishaan Jaff
|
da9a0ab928
|
Merge pull request #811 from dchristian3188/bedrock-llama
Bedrock llama
|
2023-11-16 07:57:50 -08:00 |
|
Ishaan Jaff
|
d6d0cbd63c
|
Merge pull request #826 from rodneyxr/ollama-fixes
Fix typo for initial_prompt_value and too many values to unpack error
|
2023-11-16 07:55:53 -08:00 |
|
David Christian
|
461115330b
|
updated utils for bedrock.meta streaming
|
2023-11-16 07:12:27 -08:00 |
|
Krrish Dholakia
|
a80c5841b2
|
fix(openai.py): using openai sdk for completion calls
|
2023-11-15 20:25:27 -08:00 |
|
Krrish Dholakia
|
6aef9d2db3
|
perf(openai.py): extending request timeout
|
2023-11-15 18:45:45 -08:00 |
|
Krrish Dholakia
|
41e1fcba96
|
fix(openai.py): fix linting errors
|
2023-11-15 18:33:01 -08:00 |
|
Krrish Dholakia
|
1c219106ba
|
fix(openai.py): move chat completions post call to requests
|
2023-11-15 18:29:07 -08:00 |
|
ishaan-jaff
|
0adeb06306
|
(chore) testing reset callbacks
|
2023-11-15 18:29:03 -08:00 |
|
Krrish Dholakia
|
4bd471644e
|
fix(openai.py): switch back to using requests instead of httpx
|
2023-11-15 18:25:21 -08:00 |
|
ishaan-jaff
|
0eb8ba7089
|
(chore) ci/cd testing
|
2023-11-15 18:07:04 -08:00 |
|
Krrish Dholakia
|
081400ecd4
|
fix(openai.py): fix timeout errors
|
2023-11-15 18:03:14 -08:00 |
|
ishaan-jaff
|
b397719709
|
(chore) remove bloat caching tests
|
2023-11-15 17:58:58 -08:00 |
|
Krrish Dholakia
|
f2020c2284
|
test: cleanup testing
|
2023-11-15 17:57:14 -08:00 |
|
ishaan-jaff
|
317104b555
|
(chore) speed up testing
|
2023-11-15 17:56:03 -08:00 |
|