llama-stack-mirror/llama_toolchain/inference/meta_reference
Ashwin Bharambe f27d629fe8 Reduce a bunch of dependencies from toolchain
Some improvements to the distribution install script
2024-08-07 21:55:07 -07:00
..
__init__.py Make each inference provider into its own subdirectory 2024-08-05 16:39:58 -07:00
config.py Reduce a bunch of dependencies from toolchain 2024-08-07 21:55:07 -07:00
generation.py Reduce a bunch of dependencies from toolchain 2024-08-07 21:55:07 -07:00
inference.py Make each inference provider into its own subdirectory 2024-08-05 16:39:58 -07:00
model_parallel.py update inference config to take model and not model_dir 2024-08-06 15:02:47 -07:00
parallel_utils.py Make each inference provider into its own subdirectory 2024-08-05 16:39:58 -07:00