llama-stack-mirror/llama_stack/templates
Ben Browning 8bf1d91d38 feat: Add synthetic-data-kit for file_search doc conversion
This adds a `builtin::document_conversion` tool for converting
documents when used with file_search that uses
meta-llama/synthetic-data-kit. I also have another local
implementation that uses Docling, but need to debug some segfault
issues I'm hitting locally with that so pushing this first as a
simpler reference implementation.

Long-term I think we'll want a remote implemention here as well - like
perhaps docling-serve or unstructured.io - but need to look more into
that.

This passes the existing
`tests/verifications/openai_api/test_responses.py` but doesn't yet add
any new tests for file types besides text and pdf.

Signed-off-by: Ben Browning <bbrownin@redhat.com>
2025-06-27 13:31:38 -04:00
..
bedrock refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
cerebras refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
ci-tests refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
dell refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
experimental-post-training fix: Some missed env variable changes from PR 2490 (#2538) 2025-06-26 17:59:15 -07:00
fireworks refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
groq refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
hf-endpoint refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
hf-serverless refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
llama_api refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
meta-reference-gpu fix: Some missed env variable changes from PR 2490 (#2538) 2025-06-26 17:59:15 -07:00
nvidia refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
ollama feat: Add synthetic-data-kit for file_search doc conversion 2025-06-27 13:31:38 -04:00
open-benchmark fix: Some missed env variable changes from PR 2490 (#2538) 2025-06-26 17:59:15 -07:00
passthrough refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
postgres-demo fix: Some missed env variable changes from PR 2490 (#2538) 2025-06-26 17:59:15 -07:00
remote-vllm fix: Some missed env variable changes from PR 2490 (#2538) 2025-06-26 17:59:15 -07:00
sambanova refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
starter feat: Add synthetic-data-kit for file_search doc conversion 2025-06-27 13:31:38 -04:00
tgi refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
together refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
vllm-gpu refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
watsonx refactor(env)!: enhanced environment variable substitution (#2490) 2025-06-26 08:20:08 +05:30
__init__.py Auto-generate distro yamls + docs (#468) 2024-11-18 14:57:06 -08:00
template.py chore: remove nested imports (#2515) 2025-06-26 08:01:05 +05:30