llama-stack-mirror/llama_toolchain/inference/meta_reference
2024-08-14 17:09:24 -07:00
..
__init__.py Introduce Llama stack distributions (#22) 2024-08-08 13:38:41 -07:00
config.py upgrade pydantic to latest 2024-08-12 15:14:21 -07:00
generation.py Avoid using nearly double the memory needed 2024-08-14 17:09:24 -07:00
inference.py formatting 2024-08-14 14:22:25 -04:00
model_parallel.py Introduce Llama stack distributions (#22) 2024-08-08 13:38:41 -07:00
parallel_utils.py Introduce Llama stack distributions (#22) 2024-08-08 13:38:41 -07:00