llama-stack/llama_stack/providers/impls/meta_reference/inference/quantization
Xi Yan 07f9bf723f
fix broken --list-templates with adding build.yaml files for packaging (#327)
* add build files to templates

* fix templates

* manifest

* symlink

* symlink

* precommit

* change everything to docker build.yaml

* remove image_type in templates

* fix build from templates CLI

* fix readmes
2024-10-25 12:51:22 -07:00
..
scripts fix broken --list-templates with adding build.yaml files for packaging (#327) 2024-10-25 12:51:22 -07:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00
fp8_impls.py API Updates (#73) 2024-09-17 19:51:35 -07:00
fp8_txest_disabled.py Add a test runner and 2 very simple tests for agents 2024-09-19 12:22:48 -07:00
loader.py New quantized models (#301) 2024-10-24 08:38:56 -07:00