llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-04 20:14:13 +00:00

History

skamenan7 17fbd21c0d feat(vector-io): implement global default embedding model configuration (Issue #2729 ) - Add VectorStoreConfig with global default_embedding_model and default_embedding_dimension - Support environment variables LLAMA_STACK_DEFAULT_EMBEDDING_MODEL and LLAMA_STACK_DEFAULT_EMBEDDING_DIMENSION - Implement precedence: explicit model > global default > clear error (no fallback) - Update VectorIORouter with _resolve_embedding_model() precedence logic - Remove non-deterministic 'first model in run.yaml' fallback behavior - Add vector_store_config to StackRunConfig and all distribution templates - Include comprehensive unit tests for config loading and router precedence - Update documentation with configuration examples and usage patterns - Fix error messages to include 'Failed to' prefix per coding standards Resolves deterministic vector store creation by eliminating unpredictable fallbacks and providing clear configuration options at the stack level.		2025-09-18 10:11:44 -04:00
..
advanced_apis	chore: remove absolute paths (#3263 )	2025-08-27 12:04:25 -07:00
apis	feat: introduce api leveling proposal (#3317 )	2025-09-16 18:18:36 +02:00
building_applications	chore: Updating documentation, adding exception handling for Vector Stores in RAG Tool, more tests on migration, and migrate off of inference_api for context_retriever for RAG (#3367 )	2025-09-11 14:20:11 +02:00
concepts	chore: remove absolute paths (#3263 )	2025-08-27 12:04:25 -07:00
contributing	chore: move benchmarking related code (#3406 )	2025-09-10 13:19:44 -07:00
deploying	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
distributions	feat(vector-io): implement global default embedding model configuration (Issue #2729 )	2025-09-18 10:11:44 -04:00
getting_started	feat: Updating Rag Tool to use Files API and Vector Stores API (#3344 )	2025-09-06 07:26:34 -06:00
providers	chore: various watsonx fixes (#3428 )	2025-09-16 13:55:10 +02:00
references	fix: Remove bfcl scoring function as not supported (#3281 )	2025-08-29 11:03:52 -07:00
conf.py	docs: horizontal nav bar (#3407 )	2025-09-10 12:43:36 -07:00
index.md	docs: Reorganize documentation on the webpage (#2651 )	2025-07-15 14:19:35 -07:00