Commit graph

71 commits

Author SHA1 Message Date
Ori Kotek
e74ac03169
Do not timeout when calling HF through acomplete 2023-11-23 15:56:59 +02:00
Krrish Dholakia
1218121e47 fix(huggingface_restapi.pyu): fix linting errors 2023-11-21 10:05:35 -08:00
Krrish Dholakia
846a32ca87 fix(huggingface_restapi.py): fixing formatting 2023-11-21 09:57:26 -08:00
Krrish Dholakia
6892fd8b51 fix(huggingface_restapi.py): fix huggingface response format 2023-11-21 09:57:26 -08:00
Krrish Dholakia
a89b8f55e3 fix(huggingface_restapi.py): handle generate text output 2023-11-21 09:57:26 -08:00
ishaan-jaff
50f883a2fb (fix) pydantic errors with response.time 2023-11-20 18:28:19 -08:00
Krrish Dholakia
03efc9185e fix(huggingface_restapi.py): async implementation 2023-11-15 16:54:15 -08:00
Krrish Dholakia
bcea28e2e4 fix(utils): fixing exception mapping 2023-11-15 15:51:17 -08:00
Krrish Dholakia
a59494571f fix(huggingface_restapi.py): fix linting errors 2023-11-15 15:34:21 -08:00
Krrish Dholakia
1a705bfbcb refactor(huggingface_restapi.py): moving async completion + streaming to real async calls 2023-11-15 15:14:21 -08:00
ishaan-jaff
f650be4fee (feat) completion debug view HF POST request 2023-11-14 17:57:41 -08:00
Krrish Dholakia
45b6f8b853 refactor: fixing linting issues 2023-11-11 18:52:28 -08:00
Krrish Dholakia
4f42beb9d9 refactor(huggingface,-anthropic,-replicate,-sagemaker): making huggingface, anthropic, replicate, sagemaker compatible openai v1 sdk 2023-11-11 17:38:15 -08:00
Krrish Dholakia
547598a134 refactor(bedrock.py-+-cohere.py): making bedrock and cohere compatible with openai v1 sdk 2023-11-11 17:33:19 -08:00
ishaan-jaff
a404b0fc3b (fix) remove errant print from hf 2023-11-08 11:49:15 -08:00
ishaan-jaff
3c67de7f04 (fix) hf don't fail when logprob is None 2023-11-06 14:22:09 -08:00
Krrish Dholakia
65c01eae23 fix(huggingface_restapi.py): output parsing chat template models 2023-11-06 11:43:12 -08:00
Krrish Dholakia
7c46e85ed6 refactor(bedrock.py): better exception mapping for bedrock + huggingface 2023-11-04 16:12:12 -07:00
Krrish Dholakia
ab54262d37 fix(timeout.py): import errors 2023-11-04 16:05:14 -07:00
Krrish Dholakia
5b3978eff4 fix(main.py): fixing print_verbose 2023-11-04 14:41:34 -07:00
ishaan-jaff
df57e9247a (fix) hf calculating usage non blocking 2023-11-03 18:03:19 -07:00
Krrish Dholakia
4e1885734a refactor(proxy_server.py): print statement showing how to add debug for logs 2023-11-03 17:41:14 -07:00
ishaan-jaff
6fc0c74878 (fix) remove errant print statements 2023-11-03 13:02:52 -07:00
Krrish Dholakia
512a1637eb feat(completion()): enable setting prompt templates via completion() 2023-11-02 16:24:01 -07:00
Krrish Dholakia
b305492a0b fix(huggingface_restapi.py): fix linting issue 2023-11-01 16:43:35 -07:00
Krrish Dholakia
2c4cb76ce5 fix(huggingface_restapi.py): fix embeddings for sentence-transformer models 2023-11-01 16:36:46 -07:00
ishaan-jaff
9223f7cc7a (feat) textcompletion - transform hf log probs to openai text completion 2023-10-31 17:15:35 -07:00
ishaan-jaff
63928fa166 (feat) use usage class for model responses for cohere, hf, tg ai, cohere 2023-10-27 09:58:47 -07:00
Krrish Dholakia
dcb866b353 docs(proxy_server.md): update proxy server docs to include multi-agent autogen tutorial 2023-10-17 09:22:34 -07:00
Krrish Dholakia
7572086231 style: fix linting errors 2023-10-16 17:35:08 -07:00
Krrish Dholakia
7358d2e4ea bump: version 0.8.4 → 0.8.5 2023-10-14 16:43:06 -07:00
ishaan-jaff
5a0284dd58 (fix) fix linting errors 2023-10-13 21:23:00 -07:00
Krrish Dholakia
af2fd0e0de fix: fix value error if model returns empty completion 2023-10-10 10:11:40 -07:00
Krrish Dholakia
dd7e397650 style(test_completion.py): fix merge conflict 2023-10-05 22:09:38 -07:00
Krrish Dholakia
512769e841 support n param for hf 2023-10-03 07:10:13 -07:00
Krrish Dholakia
5a19ee1a71 fix get optional params 2023-10-02 12:02:53 -07:00
Krrish Dholakia
1cae080eb2 raise exception if optional param is not mapped to model 2023-10-02 11:17:51 -07:00
ishaan-jaff
3fbad7dfa7 add hf embedding models 2023-09-29 11:57:38 -07:00
ishaan-jaff
156d4f27de auto-detect HF task 2023-09-27 17:49:31 -07:00
Krrish Dholakia
9ef0ad9e66 expose vertex ai and hf api base as env var 2023-09-22 15:14:33 -07:00
Krrish Dholakia
9ec8ea89bb fixing huggingface tgi index issue for best of 2023-09-21 10:05:22 -07:00
Krrish Dholakia
a8711dc5c2 fix hf tgi best of bug 2023-09-20 20:53:32 -07:00
Krrish Dholakia
c63db48652 return all best of sequences 2023-09-20 14:43:25 -07:00
Krrish Dholakia
5b294c704e fix hf conversational task bug 2023-09-18 11:46:36 -07:00
ishaan-jaff
0bee6e0d38 cleanup 2023-09-18 10:43:44 -07:00
ishaan-jaff
e7f4e8b4a4 allow non tgi llms 2023-09-18 10:26:57 -07:00
Krrish Dholakia
aaa57abddd map finish reason 2023-09-13 19:22:38 -07:00
Krrish Dholakia
5b6b9a9fab huggingface conversational task support 2023-09-13 13:45:23 -07:00
Krrish Dholakia
bab36c2c6f work for hf inference endpoint 2023-09-11 18:37:55 -07:00
Krrish Dholakia
4cfcabd919 adding support for vllm 2023-09-06 18:07:44 -07:00