llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-23 02:52:25 +00:00

History

skamenan7 474b50b422 Add configurable embedding models for vector IO providers This change lets users configure default embedding models at the provider level instead of always relying on system defaults. Each vector store provider can now specify an embedding_model and optional embedding_dimension in their config. Key features: - Auto-dimension lookup for standard models from the registry - Support for Matryoshka embeddings with custom dimensions - Three-tier priority: explicit params > provider config > system fallback - Full backward compatibility - existing setups work unchanged - Comprehensive test coverage with 20 test cases Updated all vector IO providers (FAISS, Chroma, Milvus, Qdrant, etc.) with the new config fields and added detailed documentation with examples. Fixes #2729		2025-07-15 16:46:40 -04:00
..
agents	fix: store configs (#2593 )	2025-07-03 10:07:23 -07:00
datasetio	fix: store configs (#2593 )	2025-07-03 10:07:23 -07:00
eval	fix: store configs (#2593 )	2025-07-03 10:07:23 -07:00
files	docs: auto generated documentation for providers (#2543 )	2025-06-30 15:13:20 +02:00
inference	feat: consolidate most distros into "starter" (#2516 )	2025-07-04 15:58:03 +02:00
post_training	feat: consolidate most distros into "starter" (#2516 )	2025-07-04 15:58:03 +02:00
safety	docs: auto generated documentation for providers (#2543 )	2025-06-30 15:13:20 +02:00
scoring	fix: allow default empty vars for conditionals (#2570 )	2025-07-01 14:42:05 +02:00
telemetry	feat: improve telemetry (#2590 )	2025-07-04 17:29:09 +02:00
tool_runtime	fix: allow default empty vars for conditionals (#2570 )	2025-07-01 14:42:05 +02:00
vector_io	Add configurable embedding models for vector IO providers	2025-07-15 16:46:40 -04:00
external.md	docs: update external provider guide and navigation (#2567 )	2025-07-01 09:42:32 +02:00
index.md	docs: update full list of providers with matched APIs and dockerhub images (#2452 )	2025-07-03 10:12:56 +02:00