llama-stack-mirror/toolchain/inference/quantization
2024-07-19 12:30:35 -07:00
..
build_conda.sh Add toolchain from agentic system here 2024-07-19 12:30:35 -07:00
fp8_impls.py Add toolchain from agentic system here 2024-07-19 12:30:35 -07:00
fp8_requirements.txt Add toolchain from agentic system here 2024-07-19 12:30:35 -07:00
generation.py Add toolchain from agentic system here 2024-07-19 12:30:35 -07:00
model.py Add toolchain from agentic system here 2024-07-19 12:30:35 -07:00
quantize_checkpoint.py Add toolchain from agentic system here 2024-07-19 12:30:35 -07:00
run_quantize_checkpoint.sh Add toolchain from agentic system here 2024-07-19 12:30:35 -07:00
test_fp8.py Add toolchain from agentic system here 2024-07-19 12:30:35 -07:00