Commit graph

93 commits

Author SHA1 Message Date
Krrish Dholakia
2f69e254aa fix(factory.py): support 'add_generation_prompt' field for hf chat templates
Fixes https://github.com/BerriAI/litellm/pull/5178#issuecomment-2306362008
2024-08-23 08:06:21 -07:00
Krrish Dholakia
8637988bc0 fix(huggingface_restapi.py): support passing 'wait_for_model' param on completion calls 2024-08-09 09:25:19 -07:00
Krrish Dholakia
cac91dcae4 fix(huggingface_restapi.py): fix hf embeddings optional param processing 2024-08-09 09:10:56 -07:00
Krrish Dholakia
d382de7b74 fix(huggingface_restapi.py): fixes issue where 'wait_for_model' was not being passed as expected 2024-08-09 08:36:35 -07:00
Krrish Dholakia
cef1298d9d fix(huggingface_restapi.py): fix linting errors 2024-07-30 14:33:08 -07:00
Krrish Dholakia
17ac0f0636 feat(huggingface_restapi.py): Support multiple hf embedding types + async hf embeddings
Closes https://github.com/BerriAI/litellm/issues/3261
2024-07-30 13:32:03 -07:00
Krrish Dholakia
c69193c321 fix: move to using pydantic obj for setting values 2024-07-11 13:18:36 -07:00
Krrish Dholakia
4c88e917bd fix(huggingface_restapi.py): fix task extraction from model name 2024-05-15 07:28:19 -07:00
Krrish Dholakia
1b5a9a5c80 docs(huggingface.md): add text-classification to huggingface docs 2024-05-10 14:39:14 -07:00
Krrish Dholakia
4680f4e1db test(test_completion.py): reintegrate testing for huggingface tgi + non-tgi 2024-05-10 14:07:01 -07:00
Krrish Dholakia
ef72f25ab6 fix: fix linting errors 2024-05-09 17:55:27 -07:00
Krrish Dholakia
f660d21743 feat(predibase.py): add support for predibase provider
Closes https://github.com/BerriAI/litellm/issues/1253
2024-05-09 16:39:43 -07:00
Krrish Dholakia
f7c2f8313c fix(huggingface_restapi.py): fix hf streaming issue 2024-03-04 21:16:41 -08:00
Krrish Dholakia
a1ce24c5f0 fix(huggingface_restapi.py): fix huggingface streaming error raising 2024-03-04 09:32:41 -08:00
Krrish Dholakia
ed72005f52 fix(huggingface_restapi.py): return initial hf error 2024-02-24 10:46:59 -08:00
Krrish Dholakia
9c2cce56f7 fix(huggingface_restapi.py): return streamed response correctly 2024-02-16 13:25:13 -08:00
Krrish Dholakia
e5a444f69a fix(huggingface_restapi.py): fix hf streaming to raise exceptions 2024-02-15 21:25:12 -08:00
Krrish Dholakia
e4a5a3395c fix(huggingface_restapi.py): support timeouts for huggingface + openai text completions
https://github.com/BerriAI/litellm/issues/1334
2024-01-08 11:40:56 +05:30
Krrish Dholakia
79978c44ba refactor: add black formatting 2023-12-25 14:11:20 +05:30
ishaan-jaff
7a1e8b4bf3 (feat) show POST request for HF embeddings 2023-12-16 13:09:49 +05:30
Krrish Dholakia
726289cb49 fix(huggingface_restapi.py): add support for additional hf embedding formats 2023-12-15 21:02:41 -08:00
Krrish Dholakia
9ca06e2827 fix(huggingface_restapi.py): raise better exceptions for unprocessable hf responses 2023-12-05 07:28:21 -08:00
Ori Kotek
cc5dbbf53f Do not timeout when calling HF through acomplete 2023-11-23 15:56:59 +02:00
Krrish Dholakia
aad17a5479 fix(huggingface_restapi.pyu): fix linting errors 2023-11-21 10:05:35 -08:00
Krrish Dholakia
53d71c229f fix(huggingface_restapi.py): fixing formatting 2023-11-21 09:57:26 -08:00
Krrish Dholakia
c48e775908 fix(huggingface_restapi.py): fix huggingface response format 2023-11-21 09:57:26 -08:00
Krrish Dholakia
c425fb7eeb fix(huggingface_restapi.py): handle generate text output 2023-11-21 09:57:26 -08:00
ishaan-jaff
7bc28f3b1c (fix) pydantic errors with response.time 2023-11-20 18:28:19 -08:00
Krrish Dholakia
e28b9534e4 fix(huggingface_restapi.py): async implementation 2023-11-15 16:54:15 -08:00
Krrish Dholakia
b42cf80585 fix(utils): fixing exception mapping 2023-11-15 15:51:17 -08:00
Krrish Dholakia
6d1b8945dd fix(huggingface_restapi.py): fix linting errors 2023-11-15 15:34:21 -08:00
Krrish Dholakia
e35ce15a89 refactor(huggingface_restapi.py): moving async completion + streaming to real async calls 2023-11-15 15:14:21 -08:00
ishaan-jaff
84e931ce0f (feat) completion debug view HF POST request 2023-11-14 17:57:41 -08:00
Krrish Dholakia
4b74ddcb17 refactor: fixing linting issues 2023-11-11 18:52:28 -08:00
Krrish Dholakia
5efa3860da refactor(huggingface,-anthropic,-replicate,-sagemaker): making huggingface, anthropic, replicate, sagemaker compatible openai v1 sdk 2023-11-11 17:38:15 -08:00
Krrish Dholakia
7b4e21e574 refactor(bedrock.py-+-cohere.py): making bedrock and cohere compatible with openai v1 sdk 2023-11-11 17:33:19 -08:00
ishaan-jaff
edccfbb1da (fix) remove errant print from hf 2023-11-08 11:49:15 -08:00
ishaan-jaff
e7b6ef8f50 (fix) hf don't fail when logprob is None 2023-11-06 14:22:09 -08:00
Krrish Dholakia
9baafd7dbf fix(huggingface_restapi.py): output parsing chat template models 2023-11-06 11:43:12 -08:00
Krrish Dholakia
d53a0d9663 refactor(bedrock.py): better exception mapping for bedrock + huggingface 2023-11-04 16:12:12 -07:00
Krrish Dholakia
b1dcd0f1b9 fix(timeout.py): import errors 2023-11-04 16:05:14 -07:00
Krrish Dholakia
f7c5595a0d fix(main.py): fixing print_verbose 2023-11-04 14:41:34 -07:00
ishaan-jaff
d278417682 (fix) hf calculating usage non blocking 2023-11-03 18:03:19 -07:00
Krrish Dholakia
fa24a61976 refactor(proxy_server.py): print statement showing how to add debug for logs 2023-11-03 17:41:14 -07:00
ishaan-jaff
ee60c04251 (fix) remove errant print statements 2023-11-03 13:02:52 -07:00
Krrish Dholakia
33c1118080 feat(completion()): enable setting prompt templates via completion() 2023-11-02 16:24:01 -07:00
Krrish Dholakia
8475772daa fix(huggingface_restapi.py): fix linting issue 2023-11-01 16:43:35 -07:00
Krrish Dholakia
80cb421e02 fix(huggingface_restapi.py): fix embeddings for sentence-transformer models 2023-11-01 16:36:46 -07:00
ishaan-jaff
525e5476f6 (feat) textcompletion - transform hf log probs to openai text completion 2023-10-31 17:15:35 -07:00
ishaan-jaff
485a7ff136 (feat) use usage class for model responses for cohere, hf, tg ai, cohere 2023-10-27 09:58:47 -07:00