llama-stack-mirror/llama_stack/providers/impls/meta_reference/inference
2024-10-10 16:03:19 -07:00
..
quantization Split off meta-reference-quantized provider 2024-10-10 16:03:19 -07:00
__init__.py Split off meta-reference-quantized provider 2024-10-10 16:03:19 -07:00
config.py Split off meta-reference-quantized provider 2024-10-10 16:03:19 -07:00
generation.py Split off meta-reference-quantized provider 2024-10-10 16:03:19 -07:00
inference.py Split off meta-reference-quantized provider 2024-10-10 16:03:19 -07:00
model_parallel.py Split off meta-reference-quantized provider 2024-10-10 16:03:19 -07:00
parallel_utils.py Split off meta-reference-quantized provider 2024-10-10 16:03:19 -07:00