Change default embedding model to all-MiniLM-L6-v2

2025-12-07 18:57:21 +00:00 · 2025-08-08 16:41:17 -04:00 · 2025-08-08 16:41:17 -04:00 · aa5618a7c2
commit aa5618a7c2
parent e411099cbf
2 changed files with 4 additions and 8 deletions
--- a/docs/source/distributions/configuration.md
+++ b/docs/source/distributions/configuration.md
@ -818,7 +818,7 @@ Precedence rules at runtime:

 1. If `embedding_model` is explicitly passed in an API call, that value is used.
 2. Otherwise the value in `vector_store_config.default_embedding_model` is used.
-3. If neither is available the server will fall back to the system default (ibm-granite/granite-embedding-125m-english).
+3. If neither is available the server will fall back to the system default (all-MiniLM-L6-v2).

 #### Environment variables

@ -834,4 +834,4 @@ export LLAMA_STACK_DEFAULT_EMBEDDING_MODEL="sentence-transformers/all-MiniLM-L6-
 llama stack run --config run.yaml
 ```

-> Tip: If you omit `vector_store_config` entirely and don't set `LLAMA_STACK_DEFAULT_EMBEDDING_MODEL`, the system will fall back to the default `ibm-granite/granite-embedding-125m-english` model with 384 dimensions for vector store creation.
+> Tip: If you omit `vector_store_config` entirely and don't set `LLAMA_STACK_DEFAULT_EMBEDDING_MODEL`, the system will fall back to the default `all-MiniLM-L6-v2` model with 384 dimensions for vector store creation.