llama-stack-mirror/llama_toolchain/inference
2024-08-24 23:36:58 -07:00
..
api agentic loop has a RAG implementation 2024-08-23 21:01:11 -07:00
meta_reference Moved ToolPromptFormat and jinja templates to llama_models.llama3.api 2024-08-23 14:58:52 -07:00
ollama use templates for generating system prompts 2024-08-23 14:21:12 -07:00
quantization llama_models.llama3_1 -> llama_models.llama3 2024-08-19 10:55:37 -07:00
__init__.py Initial commit 2024-07-23 08:32:33 -07:00
client.py re-work tool definitions, fix FastAPI issues, fix tool regressions 2024-08-24 22:35:56 -07:00
event_logger.py formatting 2024-08-14 17:03:43 -04:00
prepare_messages.py basic RAG seems to work 2024-08-24 23:36:58 -07:00
providers.py Introduce Llama stack distributions (#22) 2024-08-08 13:38:41 -07:00