llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-07-27 06:28:50 +00:00

History

Matthew Farrellee bf63470c22 feat: implement dynamic model detection support for inference providers using litellm This enhancement allows inference providers using LiteLLMOpenAIMixin to validate model availability against LiteLLM's official provider model listings, improving reliability and user experience when working with different AI service providers. - Add litellm_provider_name parameter to LiteLLMOpenAIMixin constructor - Add check_model_availability method to LiteLLMOpenAIMixin using litellm.models_by_provider - Update Gemini, Groq, and SambaNova inference adapters to pass litellm_provider_name		2025-07-24 09:49:32 -04:00
..
anthropic	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
bedrock	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
cerebras	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
databricks	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
fireworks	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
gemini	feat: implement dynamic model detection support for inference providers using litellm	2025-07-24 09:49:32 -04:00
groq	feat: implement dynamic model detection support for inference providers using litellm	2025-07-24 09:49:32 -04:00
llama_openai_compat	chore: create OpenAIMixin for inference providers with an OpenAI-compat API that need to implement openai_* methods (#2835 )	2025-07-23 06:49:40 -04:00
nvidia	chore: create OpenAIMixin for inference providers with an OpenAI-compat API that need to implement openai_* methods (#2835 )	2025-07-23 06:49:40 -04:00
ollama	feat(vllm): periodically refresh models (#2823 )	2025-07-18 15:53:09 -07:00
openai	chore: create OpenAIMixin for inference providers with an OpenAI-compat API that need to implement openai_* methods (#2835 )	2025-07-23 06:49:40 -04:00
passthrough	feat: consolidate most distros into "starter" (#2516 )	2025-07-04 15:58:03 +02:00
runpod	ci: test safety with starter (#2628 )	2025-07-09 16:53:50 +02:00
sambanova	feat: implement dynamic model detection support for inference providers using litellm	2025-07-24 09:49:32 -04:00
tgi	feat: consolidate most distros into "starter" (#2516 )	2025-07-04 15:58:03 +02:00
together	fix: Don't cache clients for passthrough auth providers (#2728 )	2025-07-11 13:38:27 -07:00
vllm	feat(vllm): periodically refresh models (#2823 )	2025-07-18 15:53:09 -07:00
watsonx	fix: allow default empty vars for conditionals (#2570 )	2025-07-01 14:42:05 +02:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00