llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-16 19:29:27 +00:00

History

Matthew Farrellee e892a3f7f4 feat: add refresh_models support to inference adapters (default: false) (#3719 ) # What does this PR do? inference adapters can now configure `refresh_models: bool` to control periodic model listing from their providers BREAKING CHANGE: together inference adapter default changed. previously always refreshed, now follows config. addresses "models: refresh" on #3517 ## Test Plan ci w/ new tests		2025-10-07 15:19:56 +02:00
..
bedrock	fix: use lambda pattern for bedrock config env vars (#3307 )	2025-09-05 10:45:11 +02:00
test_inference_client_caching.py	chore: turn OpenAIMixin into a pydantic.BaseModel (#3671 )	2025-10-06 11:33:19 -04:00
test_litellm_openai_mixin.py	feat: add static embedding metadata to dynamic model listings for providers using OpenAIMixin (#3547 )	2025-09-25 17:17:00 -04:00
test_openai_base_url_config.py	chore: turn OpenAIMixin into a pydantic.BaseModel (#3671 )	2025-10-06 11:33:19 -04:00
test_remote_vllm.py	feat: add refresh_models support to inference adapters (default: false) (#3719 )	2025-10-07 15:19:56 +02:00