llama-stack-mirror/llama_toolchain/inference
2024-09-11 16:05:35 -07:00
..
adapters [inference] Add a TGI adapter (#52) 2024-09-04 22:49:33 -07:00
api fix api to work with openapi generator 2024-09-11 16:05:35 -07:00
meta_reference fix inference 2024-09-11 15:15:16 -07:00
quantization API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
__init__.py Initial commit 2024-07-23 08:32:33 -07:00
client.py [1/n] migrate inference/chat_completion 2024-09-11 12:21:19 -07:00
event_logger.py formatting 2024-08-14 17:03:43 -04:00
prepare_messages.py API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
providers.py Simplified Telemetry API and tying it to logger (#57) 2024-09-11 15:37:05 -07:00