forked from phoenix-oss/llama-stack-mirror
* Added hadamard transform for spinquant * Changed from config to model_args * Added an assertion for model args * Use enum.value to check against str * pre-commit --------- Co-authored-by: Sachin Mehta <sacmehta@fb.com> Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com> |
||
|---|---|---|
| .. | ||
| quantization | ||
| __init__.py | ||
| config.py | ||
| generation.py | ||
| inference.py | ||
| model_parallel.py | ||
| parallel_utils.py | ||