llama-stack-mirror/llama_toolchain/inference
2024-08-08 08:22:13 -07:00
..
api Reduce a bunch of dependencies from toolchain 2024-08-07 21:55:07 -07:00
meta_reference Reduce a bunch of dependencies from toolchain 2024-08-07 21:55:07 -07:00
ollama update dependencies and rely on LLAMA_TOOLCHAIN_DIR for dev purposes 2024-08-08 08:22:13 -07:00
quantization Distribution server now functioning 2024-08-02 13:37:40 -07:00
__init__.py Initial commit 2024-07-23 08:32:33 -07:00
client.py minor fixes 2024-08-06 18:56:34 -07:00
event_logger.py Added Ollama as an inference impl (#20) 2024-07-31 22:08:37 -07:00
providers.py Make each inference provider into its own subdirectory 2024-08-05 16:39:58 -07:00