llama-stack-mirror/llama_toolchain/inference
2024-08-08 10:45:32 -07:00
..
api Reduce a bunch of dependencies from toolchain 2024-08-07 21:55:07 -07:00
meta_reference for inline make 8b model the default 2024-08-08 10:45:32 -07:00
ollama update dependencies and rely on LLAMA_TOOLCHAIN_DIR for dev purposes 2024-08-08 08:22:13 -07:00
quantization Distribution server now functioning 2024-08-02 13:37:40 -07:00
__init__.py Initial commit 2024-07-23 08:32:33 -07:00
client.py minor fixes 2024-08-06 18:56:34 -07:00
event_logger.py Added Ollama as an inference impl (#20) 2024-07-31 22:08:37 -07:00
providers.py Remove additional_pip_packages; move deps to providers 2024-08-08 10:34:18 -07:00