llama-stack-mirror/llama_toolchain/inference
2024-09-10 18:22:09 +02:00
..
adapters Improve TGI adapter initialization condition 2024-09-10 18:22:09 +02:00
api API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
meta_reference API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
quantization API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
__init__.py Initial commit 2024-07-23 08:32:33 -07:00
client.py API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
event_logger.py formatting 2024-08-14 17:03:43 -04:00
prepare_messages.py API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
providers.py Use huggingface_hub inference client for TGI inference 2024-09-05 18:29:04 +02:00