llama-stack/llama_stack/providers/inline/inference/meta_reference/quantization/scripts
2025-03-01 12:48:08 -08:00
..
__init__.py Add provider deprecation support; change directory structure (#397) 2024-11-07 13:04:53 -08:00
quantize_checkpoint.py chore: remove dependency on llama_models completely (#1344) 2025-03-01 12:48:08 -08:00
run_quantize_checkpoint.sh Fix fp8 quantization script. (#500) 2024-11-21 09:15:28 -08:00