mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-10-21 16:07:16 +00:00
Kill the `builtin::rag` tool group completely since it is no longer targeted. We use the Responses implementation for knowledge_search which uses the `openai_vector_stores` pathway. --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
28 lines
709 B
YAML
28 lines
709 B
YAML
version: 2
|
|
distribution_spec:
|
|
description: Use NVIDIA NIM for running LLM inference, evaluation and safety
|
|
providers:
|
|
inference:
|
|
- provider_type: remote::nvidia
|
|
vector_io:
|
|
- provider_type: inline::faiss
|
|
safety:
|
|
- provider_type: remote::nvidia
|
|
agents:
|
|
- provider_type: inline::meta-reference
|
|
eval:
|
|
- provider_type: remote::nvidia
|
|
post_training:
|
|
- provider_type: remote::nvidia
|
|
datasetio:
|
|
- provider_type: inline::localfs
|
|
- provider_type: remote::nvidia
|
|
scoring:
|
|
- provider_type: inline::basic
|
|
tool_runtime: []
|
|
files:
|
|
- provider_type: inline::localfs
|
|
image_type: venv
|
|
additional_pip_packages:
|
|
- aiosqlite
|
|
- sqlalchemy[asyncio]
|