llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-09 19:29:18 +00:00

History

Ashwin Bharambe 072d1b7205 Add completion() impl for meta-reference		2024-10-18 20:41:21 -07:00
..
quantization	Fix fp8 implementation which had bit-rotten a bit	2024-10-15 13:57:01 -07:00
__init__.py	Split off meta-reference-quantized provider	2024-10-10 16:03:19 -07:00
config.py	Allow overridding checkpoint_dir via config	2024-10-18 14:28:06 -07:00
generation.py	Add completion() impl for meta-reference	2024-10-18 20:41:21 -07:00
inference.py	Add completion() impl for meta-reference	2024-10-18 20:41:21 -07:00
model_parallel.py	Make all API methods `async def` again	2024-10-18 19:52:56 -07:00
parallel_utils.py	Make all API methods `async def` again	2024-10-18 19:52:56 -07:00