llama-stack-mirror/llama_toolchain/inference/meta_reference
2024-09-13 16:39:02 -07:00
..
__init__.py API Updates: fleshing out RAG APIs, introduce "llama stack" CLI command (#51) 2024-09-03 22:39:39 -07:00
config.py Nuke hardware_requirements from SKUs 2024-09-13 16:39:02 -07:00
generation.py Nuke hardware_requirements from SKUs 2024-09-13 16:39:02 -07:00
inference.py Remove request wrapper migration (#64) 2024-09-12 15:03:49 -07:00
model_parallel.py Nuke hardware_requirements from SKUs 2024-09-13 16:39:02 -07:00
parallel_utils.py Introduce Llama stack distributions (#22) 2024-08-08 13:38:41 -07:00