mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-07-24 21:29:53 +00:00
This wires up the Files API optional dependency into sqlite_vec and adds the localfs Files provider to our starter template, so that Responses API file_search tool works out of the box for sqlite_vec in that template. Some additional testing with this provider plus some other inference models led me to loosen the verification test results checking a bit - not for the tool call, but just around the assistant response with the file_search tool call. Some providers, such as OpenAI SaaS, make multiple tool calls to resolve the query sometimes, especially when it cannot find an answer so tries a few permutations before returning empty results to the user in that test. Signed-off-by: Ben Browning <bbrownin@redhat.com>
46 lines
1 KiB
YAML
46 lines
1 KiB
YAML
version: '2'
|
|
distribution_spec:
|
|
description: Quick start template for running Llama Stack with several popular providers
|
|
providers:
|
|
inference:
|
|
- remote::openai
|
|
- remote::fireworks
|
|
- remote::together
|
|
- remote::ollama
|
|
- remote::anthropic
|
|
- remote::gemini
|
|
- remote::groq
|
|
- remote::sambanova
|
|
- remote::vllm
|
|
- inline::sentence-transformers
|
|
vector_io:
|
|
- inline::sqlite-vec
|
|
- remote::chromadb
|
|
- remote::pgvector
|
|
files:
|
|
- inline::localfs
|
|
safety:
|
|
- inline::llama-guard
|
|
agents:
|
|
- inline::meta-reference
|
|
telemetry:
|
|
- inline::meta-reference
|
|
eval:
|
|
- inline::meta-reference
|
|
datasetio:
|
|
- remote::huggingface
|
|
- inline::localfs
|
|
scoring:
|
|
- inline::basic
|
|
- inline::llm-as-judge
|
|
- inline::braintrust
|
|
tool_runtime:
|
|
- remote::brave-search
|
|
- remote::tavily-search
|
|
- inline::rag-runtime
|
|
- remote::model-context-protocol
|
|
image_type: conda
|
|
additional_pip_packages:
|
|
- aiosqlite
|
|
- asyncpg
|
|
- sqlalchemy[asyncio]
|