llama-stack-mirror/llama_stack/providers/utils
skamenan7 474b50b422 Add configurable embedding models for vector IO providers
This change lets users configure default embedding models at the provider level instead of always relying on system defaults. Each vector store provider can now specify an embedding_model and optional embedding_dimension in their config.

Key features:
- Auto-dimension lookup for standard models from the registry
- Support for Matryoshka embeddings with custom dimensions
- Three-tier priority: explicit params > provider config > system fallback
- Full backward compatibility - existing setups work unchanged
- Comprehensive test coverage with 20 test cases

Updated all vector IO providers (FAISS, Chroma, Milvus, Qdrant, etc.) with the new config fields and added detailed documentation with examples.

Fixes #2729
2025-07-15 16:46:40 -04:00
..
bedrock feat: drop python 3.10 support (#2469) 2025-06-19 12:07:14 +05:30
common chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
datasetio chore(refact): move paginate_records fn outside of datasetio (#2137) 2025-05-12 10:56:14 -07:00
inference fix: address reviewer feedback - improve conditional imports and remove provider alias logic\n\n- Improve conditional import approach with better documentation\n- Remove provider-specific alias logic from sku_list.py\n- Conditional imports are necessary because llama4 requires torch\n- Addresses @ashwinb and @raghotham feedback while maintaining compatibility 2025-07-15 13:21:33 -04:00
kvstore fix: store configs (#2593) 2025-07-03 10:07:23 -07:00
memory fix: handle encoding errors when adding files to vector store (#2574) 2025-07-04 12:10:18 +02:00
responses fix: add missing argument and methods (#2550) 2025-06-30 14:55:37 +02:00
scoring chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
sqlstore fix: auth sql store: user is owner policy (#2674) 2025-07-10 14:40:32 -07:00
telemetry chore: update pre-commit hook versions (#2708) 2025-07-10 16:47:59 +02:00
tools chore: bump python supported version to 3.12 (#2475) 2025-06-24 09:22:04 +05:30
vector_io Add configurable embedding models for vector IO providers 2025-07-15 16:46:40 -04:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00
pagination.py chore(refact): move paginate_records fn outside of datasetio (#2137) 2025-05-12 10:56:14 -07:00
scheduler.py chore: bump python supported version to 3.12 (#2475) 2025-06-24 09:22:04 +05:30