llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-20 14:58:41 +00:00

History

Matthew Farrellee 521865c388 feat: include all models from provider's /v1/models (#3471 ) # What does this PR do? this replaces the static model listing for any provider using OpenAIMixin currently - - anthropic - azure openai - gemini - groq - llama-api - nvidia - openai - sambanova - tgi - vertexai - vllm - not changed: together has its own impl ## Test Plan - new unit tests - manual for llama-api, openai, groq, gemini ``` for provider in llama-openai-compat openai groq gemini; do uv run llama stack build --image-type venv --providers inference=remote::provider --run & uv run --with llama-stack-client llama-stack-client models list \| grep Total ``` results (17 sep 2025): - llama-api: 4 - openai: 86 - groq: 21 - gemini: 66 closes #3467		2025-09-18 05:17:11 -04:00
..
bedrock	fix: use lambda pattern for bedrock config env vars (#3307 )	2025-09-05 10:45:11 +02:00
test_inference_client_caching.py	chore: update the groq inference impl to use openai-python for openai-compat functions (#3348 )	2025-09-06 15:36:27 -07:00
test_litellm_openai_mixin.py	feat: Add clear error message when API key is missing (#2992 )	2025-07-31 16:33:16 -04:00
test_openai_base_url_config.py	feat: include all models from provider's /v1/models (#3471 )	2025-09-18 05:17:11 -04:00
test_remote_vllm.py	feat: Add dynamic authentication token forwarding support for vLLM (#3388 )	2025-09-18 11:13:55 +02:00