llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-20 11:18:41 +00:00

History

Matthew Farrellee 466ef6f490 feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin - remove auto-download of ollama embedding models - add embedding model metadata to dynamic listing w/ unit test - add support and tests for allowed_models - removed inference provider models.py files where dynamic listing is enabled - store embedding metadata in embedding_model_metadata field on inference providers - make model_entries optional on ModelRegistryHelper and LiteLLMOpenAIMixin - make OpenAIMixin a ModelRegistryHelper - skip base64 embedding test for remote::ollama, always returns floats - only use OpenAI client for ollama model listing - remove unused build_model_entry function - remove unused get_huggingface_repo function		2025-09-25 04:56:54 -04:00
..
inference	feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin	2025-09-25 04:56:54 -04:00
memory	chore: Updating documentation, adding exception handling for Vector Stores in RAG Tool, more tests on migration, and migrate off of inference_api for context_retriever for RAG (#3367 )	2025-09-11 14:20:11 +02:00
__init__.py	fix: add check for interleavedContent (#1973 )	2025-05-06 09:55:07 -07:00
test_model_registry.py	feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin	2025-09-25 04:56:54 -04:00
test_scheduler.py	chore: default to pytest asyncio-mode=auto (#2730 )	2025-07-11 13:00:24 -07:00