llama-stack-mirror/llama_stack/providers/impls
2024-10-25 11:48:24 -07:00
..
ios/inference Update iOS inference instructions for new quantization 2024-10-24 14:47:27 -04:00
meta_reference Added hadamard transform for spinquant 2024-10-25 11:48:24 -07:00
vllm Make vllm inference better 2024-10-24 22:52:47 -07:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00