llama-stack-mirror/llama_toolchain/inference/meta_reference
2024-08-08 10:45:32 -07:00
..
__init__.py Make each inference provider into its own subdirectory 2024-08-05 16:39:58 -07:00
config.py for inline make 8b model the default 2024-08-08 10:45:32 -07:00
generation.py Reduce a bunch of dependencies from toolchain 2024-08-07 21:55:07 -07:00
inference.py Make each inference provider into its own subdirectory 2024-08-05 16:39:58 -07:00
model_parallel.py update inference config to take model and not model_dir 2024-08-06 15:02:47 -07:00
parallel_utils.py Make each inference provider into its own subdirectory 2024-08-05 16:39:58 -07:00