mirror of
				https://github.com/meta-llama/llama-stack.git
				synced 2025-10-25 01:01:13 +00:00 
			
		
		
		
	# What does this PR do?
Refactor setting default vector store provider and embedding model to
use an optional `vector_stores` config in the `StackRunConfig` and clean
up code to do so (had to add back in some pieces of VectorDB). Also
added remote Qdrant and Weaviate to starter distro (based on other PR
where inference providers were added for UX).
New config is simply (default for Starter distro):
```yaml
vector_stores:
  default_provider_id: faiss
  default_embedding_model:
    provider_id: sentence-transformers
    model_id: nomic-ai/nomic-embed-text-v1.5
```
## Test Plan
CI and Unit tests.
---------
Signed-off-by: Francisco Javier Arceo <farceo@redhat.com>
Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>
		
	
			
		
			
				
	
	
		
			60 lines
		
	
	
	
		
			1.9 KiB
		
	
	
	
		
			YAML
		
	
	
	
	
	
			
		
		
	
	
			60 lines
		
	
	
	
		
			1.9 KiB
		
	
	
	
		
			YAML
		
	
	
	
	
	
| version: 2
 | |
| distribution_spec:
 | |
|   description: Quick start template for running Llama Stack with several popular providers.
 | |
|     This distribution is intended for CPU-only environments.
 | |
|   providers:
 | |
|     inference:
 | |
|     - provider_type: remote::cerebras
 | |
|     - provider_type: remote::ollama
 | |
|     - provider_type: remote::vllm
 | |
|     - provider_type: remote::tgi
 | |
|     - provider_type: remote::fireworks
 | |
|     - provider_type: remote::together
 | |
|     - provider_type: remote::bedrock
 | |
|     - provider_type: remote::nvidia
 | |
|     - provider_type: remote::openai
 | |
|     - provider_type: remote::anthropic
 | |
|     - provider_type: remote::gemini
 | |
|     - provider_type: remote::vertexai
 | |
|     - provider_type: remote::groq
 | |
|     - provider_type: remote::sambanova
 | |
|     - provider_type: remote::azure
 | |
|     - provider_type: inline::sentence-transformers
 | |
|     vector_io:
 | |
|     - provider_type: inline::faiss
 | |
|     - provider_type: inline::sqlite-vec
 | |
|     - provider_type: inline::milvus
 | |
|     - provider_type: remote::chromadb
 | |
|     - provider_type: remote::pgvector
 | |
|     - provider_type: remote::qdrant
 | |
|     - provider_type: remote::weaviate
 | |
|     files:
 | |
|     - provider_type: inline::localfs
 | |
|     safety:
 | |
|     - provider_type: inline::llama-guard
 | |
|     - provider_type: inline::code-scanner
 | |
|     agents:
 | |
|     - provider_type: inline::meta-reference
 | |
|     post_training:
 | |
|     - provider_type: inline::torchtune-cpu
 | |
|     eval:
 | |
|     - provider_type: inline::meta-reference
 | |
|     datasetio:
 | |
|     - provider_type: remote::huggingface
 | |
|     - provider_type: inline::localfs
 | |
|     scoring:
 | |
|     - provider_type: inline::basic
 | |
|     - provider_type: inline::llm-as-judge
 | |
|     - provider_type: inline::braintrust
 | |
|     tool_runtime:
 | |
|     - provider_type: remote::brave-search
 | |
|     - provider_type: remote::tavily-search
 | |
|     - provider_type: inline::rag-runtime
 | |
|     - provider_type: remote::model-context-protocol
 | |
|     batches:
 | |
|     - provider_type: inline::reference
 | |
| image_type: venv
 | |
| additional_pip_packages:
 | |
| - aiosqlite
 | |
| - asyncpg
 | |
| - sqlalchemy[asyncio]
 |