ishaan-jaff
|
f026ed4772
|
(test) test seed, response format, for gpt-3.5-turbo
|
2023-11-17 14:00:42 -08:00 |
|
ishaan-jaff
|
bd82559553
|
(v1.0+ breaking change) get_max_tokens -> return int
|
2023-11-17 10:38:50 -08:00 |
|
Krrish Dholakia
|
3d45d8a58c
|
test: load test router
|
2023-11-17 08:23:44 -08:00 |
|
ishaan-jaff
|
c7aba49d83
|
(ci/cd) re run pipeline
|
2023-11-17 08:07:02 -08:00 |
|
ishaan-jaff
|
3e2750ffb9
|
(ci/cd) re run pipeline
|
2023-11-16 21:55:10 -08:00 |
|
ishaan-jaff
|
25b2bc6da9
|
(test) add --debug to cli tool
|
2023-11-16 14:46:26 -08:00 |
|
Krrish Dholakia
|
51bf637656
|
feat: global client for sync + async calls (openai + Azure only)
|
2023-11-16 14:44:13 -08:00 |
|
Krrish Dholakia
|
a94c09c13c
|
fix(openai.py): handling extra headers
|
2023-11-16 12:48:21 -08:00 |
|
ishaan-jaff
|
2dc411fdb3
|
(test) hf streaming
|
2023-11-16 12:24:31 -08:00 |
|
Krrish Dholakia
|
f582189cea
|
test(loadtest_router.py): commenting out of ci/cd
|
2023-11-16 12:17:25 -08:00 |
|
Krrish Dholakia
|
bf0f8b824c
|
fix(azure.py): use openai client sdk for handling sync+async calling
|
2023-11-16 12:08:12 -08:00 |
|
ishaan-jaff
|
3285113d2d
|
(test) regular hf tests
|
2023-11-16 12:00:49 -08:00 |
|
ishaan-jaff
|
baf4e83738
|
(test) text_completion
|
2023-11-16 11:37:46 -08:00 |
|
ishaan-jaff
|
a1223e1f55
|
(test) proxy cli
|
2023-11-16 11:19:09 -08:00 |
|
ishaan-jaff
|
b607e5eb2a
|
(test) proxy cli test
|
2023-11-16 11:13:39 -08:00 |
|
Krrish Dholakia
|
a23c0a2599
|
fix(openai.py): fix linting issues
|
2023-11-16 11:01:28 -08:00 |
|
ishaan-jaff
|
6b14c8d2de
|
(test) langfuse logger
|
2023-11-16 10:45:35 -08:00 |
|
Krrish Dholakia
|
bb51216846
|
fix(openai.py): supporting openai client sdk for handling sync + async calls (incl. for openai-compatible apis)
|
2023-11-16 10:35:03 -08:00 |
|
ishaan-jaff
|
b283dd2a07
|
(test) add proxy cli testing
|
2023-11-16 09:51:17 -08:00 |
|
Krrish Dholakia
|
93aae8669d
|
fix(openai.py): using openai sdk for completion calls
|
2023-11-15 20:25:27 -08:00 |
|
Krrish Dholakia
|
da68e1ea81
|
perf(openai.py): extending request timeout
|
2023-11-15 18:45:45 -08:00 |
|
ishaan-jaff
|
15439a6aa9
|
(chore) testing reset callbacks
|
2023-11-15 18:29:03 -08:00 |
|
Krrish Dholakia
|
fc8b54eb73
|
fix(openai.py): switch back to using requests instead of httpx
|
2023-11-15 18:25:21 -08:00 |
|
ishaan-jaff
|
5990d046c4
|
(chore) ci/cd testing
|
2023-11-15 18:07:04 -08:00 |
|
Krrish Dholakia
|
2750a26633
|
fix(openai.py): fix timeout errors
|
2023-11-15 18:03:14 -08:00 |
|
ishaan-jaff
|
3945bfadb7
|
(chore) remove bloat caching tests
|
2023-11-15 17:58:58 -08:00 |
|
Krrish Dholakia
|
a3d280baa3
|
test: cleanup testing
|
2023-11-15 17:57:14 -08:00 |
|
ishaan-jaff
|
a48445c11c
|
(chore) speed up testing
|
2023-11-15 17:56:03 -08:00 |
|
ishaan-jaff
|
985220cc09
|
(chore) remove bloat
|
2023-11-15 17:52:57 -08:00 |
|
Krrish Dholakia
|
9fd7c5b343
|
test: set request timeout at request level
|
2023-11-15 17:42:31 -08:00 |
|
ishaan-jaff
|
eeec8a1765
|
(chore) remove bloat testing
|
2023-11-15 17:40:24 -08:00 |
|
ishaan-jaff
|
a7c31d54fa
|
(fix) testing model alias map
|
2023-11-15 17:40:15 -08:00 |
|
ishaan-jaff
|
6f0b34c771
|
(chore) remvoe bloat testing
|
2023-11-15 17:33:15 -08:00 |
|
ishaan-jaff
|
bba0a2dcf7
|
(chore) remove bloat
|
2023-11-15 17:16:14 -08:00 |
|
Krrish Dholakia
|
03efc9185e
|
fix(huggingface_restapi.py): async implementation
|
2023-11-15 16:54:15 -08:00 |
|
Krrish Dholakia
|
f84db3ce14
|
feat(get_max_tokens): get max tokens for huggingface hub models
|
2023-11-15 15:25:40 -08:00 |
|
Krrish Dholakia
|
1a705bfbcb
|
refactor(huggingface_restapi.py): moving async completion + streaming to real async calls
|
2023-11-15 15:14:21 -08:00 |
|
Krrish Dholakia
|
c8b73d90f1
|
fix(utils.py): fix langfuse integration
|
2023-11-15 14:05:40 -08:00 |
|
Krrish Dholakia
|
8c104e9c6a
|
fix(azure.py-+-proxy_server.py): fix function calling response object + support router on proxy
|
2023-11-15 13:15:16 -08:00 |
|
Krrish Dholakia
|
563a3d3016
|
refactor(router.py): renaming variable
|
2023-11-15 12:31:29 -08:00 |
|
Krrish Dholakia
|
03303033e5
|
feat(router.py): enable passing chat completion params for Router.chat.completion.create
|
2023-11-15 12:28:16 -08:00 |
|
Krrish Dholakia
|
f5abde775b
|
test(test_completion.py): fix palm test
|
2023-11-14 22:28:56 -08:00 |
|
Krrish Dholakia
|
e07bf0a8de
|
fix(utils.py): await async function in client wrapper
|
2023-11-14 22:07:28 -08:00 |
|
Krrish Dholakia
|
59eaeba92a
|
fix(router.py): enabling retrying with expo backoff (without tenacity) for router
|
2023-11-14 20:57:51 -08:00 |
|
Krrish Dholakia
|
1738341dcb
|
fix(main.py): misrouting ollama models to nlp cloud
|
2023-11-14 18:55:08 -08:00 |
|
Krrish Dholakia
|
a7222f257c
|
fix(main.py): keep client consistent across calls + exponential backoff retry on ratelimit errors
|
2023-11-14 16:26:05 -08:00 |
|
ishaan-jaff
|
d80e2310ed
|
(test) AI21 test
|
2023-11-14 15:01:50 -08:00 |
|
ishaan-jaff
|
59d217f373
|
(test) completion azure_ad_token
|
2023-11-14 12:48:31 -08:00 |
|
ishaan-jaff
|
dd50aeddbb
|
(test) azure_ad_token
|
2023-11-14 12:12:14 -08:00 |
|
Krrish Dholakia
|
e9e86cac79
|
fix(palm.py): exception mapping bad requests / filtered responses
|
2023-11-14 11:53:13 -08:00 |
|