forked from phoenix-oss/llama-stack-mirror
Just moving bits to a better place ## Test Plan ```bash torchrun $CONDA_PREFIX/bin/pytest -s -v test_text_inference.py ``` |
||
---|---|---|
.. | ||
llama3 | ||
quantization | ||
__init__.py | ||
config.py | ||
generation.py | ||
inference.py | ||
model_parallel.py | ||
parallel_utils.py |