llama-stack/llama_toolchain/inference
2024-08-14 17:44:36 -07:00
..
api formatting 2024-08-14 17:03:43 -04:00
meta_reference Avoid using nearly double the memory needed (#30) 2024-08-14 17:44:36 -07:00
ollama formatting 2024-08-14 14:22:25 -04:00
quantization Introduce Llama stack distributions (#22) 2024-08-08 13:38:41 -07:00
__init__.py Initial commit 2024-07-23 08:32:33 -07:00
client.py Introduce Llama stack distributions (#22) 2024-08-08 13:38:41 -07:00
event_logger.py formatting 2024-08-14 17:03:43 -04:00
providers.py Introduce Llama stack distributions (#22) 2024-08-08 13:38:41 -07:00