mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-12-27 08:52:00 +00:00
This adds a `builtin::document_conversion` tool for converting documents when used with file_search that uses meta-llama/synthetic-data-kit. I also have another local implementation that uses Docling, but need to debug some segfault issues I'm hitting locally with that so pushing this first as a simpler reference implementation. Long-term I think we'll want a remote implemention here as well - like perhaps docling-serve or unstructured.io - but need to look more into that. This passes the existing `tests/verifications/openai_api/test_responses.py` but doesn't yet add any new tests for file types besides text and pdf. Signed-off-by: Ben Browning <bbrownin@redhat.com>
40 lines
895 B
YAML
40 lines
895 B
YAML
version: 2
|
|
distribution_spec:
|
|
description: Use (an external) Ollama server for running LLM inference
|
|
providers:
|
|
inference:
|
|
- remote::ollama
|
|
vector_io:
|
|
- inline::faiss
|
|
- remote::chromadb
|
|
- remote::pgvector
|
|
safety:
|
|
- inline::llama-guard
|
|
agents:
|
|
- inline::meta-reference
|
|
telemetry:
|
|
- inline::meta-reference
|
|
eval:
|
|
- inline::meta-reference
|
|
datasetio:
|
|
- remote::huggingface
|
|
- inline::localfs
|
|
scoring:
|
|
- inline::basic
|
|
- inline::llm-as-judge
|
|
- inline::braintrust
|
|
files:
|
|
- inline::localfs
|
|
post_training:
|
|
- inline::huggingface
|
|
tool_runtime:
|
|
- remote::brave-search
|
|
- remote::tavily-search
|
|
- inline::rag-runtime
|
|
- inline::synthetic-data-kit
|
|
- remote::model-context-protocol
|
|
- remote::wolfram-alpha
|
|
image_type: conda
|
|
additional_pip_packages:
|
|
- aiosqlite
|
|
- sqlalchemy[asyncio]
|