llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-07-27 06:28:50 +00:00

History

Matthew Farrellee bf63470c22 feat: implement dynamic model detection support for inference providers using litellm This enhancement allows inference providers using LiteLLMOpenAIMixin to validate model availability against LiteLLM's official provider model listings, improving reliability and user experience when working with different AI service providers. - Add litellm_provider_name parameter to LiteLLMOpenAIMixin constructor - Add check_model_availability method to LiteLLMOpenAIMixin using litellm.models_by_provider - Update Gemini, Groq, and SambaNova inference adapters to pass litellm_provider_name		2025-07-24 09:49:32 -04:00
..
__init__.py	chore: enable pyupgrade fixes (#1806 )	2025-05-01 14:23:50 -07:00
embedding_mixin.py	feat(registry): more flexible model lookup (#2859 )	2025-07-22 15:22:48 -07:00
inference_store.py	feat: support auth attributes in inference/responses stores (#2389 )	2025-06-20 10:24:45 -07:00
litellm_openai_mixin.py	feat: implement dynamic model detection support for inference providers using litellm	2025-07-24 09:49:32 -04:00
model_registry.py	chore: create OpenAIMixin for inference providers with an OpenAI-compat API that need to implement openai_* methods (#2835 )	2025-07-23 06:49:40 -04:00
openai_compat.py	chore: remove nested imports (#2515 )	2025-06-26 08:01:05 +05:30
openai_mixin.py	chore: create OpenAIMixin for inference providers with an OpenAI-compat API that need to implement openai_* methods (#2835 )	2025-07-23 06:49:40 -04:00
prompt_adapter.py	fix(ollama): Download remote image URLs for Ollama (#2551 )	2025-06-30 20:36:11 +05:30
stream_utils.py	feat: drop python 3.10 support (#2469 )	2025-06-19 12:07:14 +05:30