llama-stack-mirror/llama_stack/providers/remote/inference/passthrough
Matthew Farrellee 466ef6f490 feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin
- remove auto-download of ollama embedding models
- add embedding model metadata to dynamic listing w/ unit test
- add support and tests for allowed_models
- removed inference provider models.py files where dynamic listing is enabled
- store embedding metadata in embedding_model_metadata field on inference providers
- make model_entries optional on ModelRegistryHelper and LiteLLMOpenAIMixin
- make OpenAIMixin a ModelRegistryHelper
- skip base64 embedding test for remote::ollama, always returns floats
- only use OpenAI client for ollama model listing
- remove unused build_model_entry function
- remove unused get_huggingface_repo function
2025-09-25 04:56:54 -04:00
..
__init__.py feat: inference passthrough provider (#1166) 2025-02-19 21:47:00 -08:00
config.py feat: consolidate most distros into "starter" (#2516) 2025-07-04 15:58:03 +02:00
passthrough.py feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin 2025-09-25 04:56:54 -04:00