Commit graph

3 commits

Author SHA1 Message Date
Hardik Shah
f9111652ef rename toolchain/ --> llama_toolchain/ 2024-07-21 23:48:38 -07:00
Ashwin Bharambe
ad62e2e1f3 make inference server load checkpoints for fp8 inference
- introduce quantization related args for inference config
- also kill GeneratorArgs
2024-07-20 22:54:48 -07:00
Ashwin Bharambe
95781ec85d Add toolchain from agentic system here 2024-07-19 12:30:35 -07:00