litellm-mirror/litellm/llms/openai/chat
Krish Dholakia 34bdf36eab
Add inference providers support for Hugging Face (#8258) (#9738) (#9773)
* Add inference providers support for Hugging Face (#8258)

* add first version of inference providers for huggingface

* temporarily skipping tests

* Add documentation

* Fix titles

* remove max_retries from params and clean up

* add suggestions

* use llm http handler

* update doc

* add suggestions

* run formatters

* add tests

* revert

* revert

* rename file

* set maxsize for lru cache

* fix embeddings

* fix inference url

* fix tests following breaking change in main

* use ChatCompletionRequest

* fix tests and lint

* [Hugging Face] Remove outdated chat completion tests and fix embedding tests (#9749)

* remove or fix tests

* fix link in doc

* fix(config_settings.md): document hf api key

---------

Co-authored-by: célina <hanouticelina@gmail.com>
2025-04-05 10:50:15 -07:00
..
gpt_audio_transformation.py (code quality) run ruff rule to ban unused imports (#7313) 2024-12-19 12:33:42 -08:00
gpt_transformation.py Add inference providers support for Hugging Face (#8258) (#9738) (#9773) 2025-04-05 10:50:15 -07:00
o_series_handler.py Improved O3 + Azure O3 support (#8181) 2025-02-01 09:52:28 -08:00
o_series_transformation.py build: merge branch 2025-03-02 08:31:57 -08:00