llama-stack/llama_toolchain
Celina Hanouti 736092f6bc
[Inference] Use huggingface_hub inference client for TGI adapter (#53)
* Use huggingface_hub inference client for TGI inference

* Update the default value for TGI URL

* Use InferenceClient.text_generation for TGI inference

* Fixes post-review and split TGI adapter into local and Inference Endpoints ones

* Update CLI reference and add typing

* Rename TGI Adapter class

* Use HfApi to get the namespace when not provide in the hf endpoint name

* Remove unecessary method argument

* Improve TGI adapter initialization condition

* Move helper into impl file + fix merging conflicts
2024-09-12 09:11:35 -07:00
..
agentic_system Simplified Telemetry API and tying it to logger (#57) 2024-09-11 14:25:37 -07:00
batch_inference API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
cli Add config file based CLI (#60) 2024-09-11 11:39:46 -07:00
common Enable Bing search (#59) 2024-09-10 12:34:29 -07:00
core [Inference] Use huggingface_hub inference client for TGI adapter (#53) 2024-09-12 09:11:35 -07:00
dataset/api API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
evaluations/api API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
inference [Inference] Use huggingface_hub inference client for TGI adapter (#53) 2024-09-12 09:11:35 -07:00
memory Simplified Telemetry API and tying it to logger (#57) 2024-09-11 14:25:37 -07:00
models/api formatting 2024-08-14 17:03:43 -04:00
post_training/api API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
reward_scoring/api API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
safety Simplified Telemetry API and tying it to logger (#57) 2024-09-11 14:25:37 -07:00
synthetic_data_generation/api API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
telemetry Simplified Telemetry API and tying it to logger (#57) 2024-09-11 14:25:37 -07:00
tools Enable Bing search (#59) 2024-09-10 12:34:29 -07:00
__init__.py Initial commit 2024-07-23 08:32:33 -07:00
stack.py add safety to openapi spec (#62) 2024-09-10 17:47:13 -07:00