llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-06 18:40:57 +00:00

History

Matthew Farrellee e892a3f7f4 feat: add refresh_models support to inference adapters (default: false) (#3719 ) # What does this PR do? inference adapters can now configure `refresh_models: bool` to control periodic model listing from their providers BREAKING CHANGE: together inference adapter default changed. previously always refreshed, now follows config. addresses "models: refresh" on #3517 ## Test Plan ci w/ new tests		2025-10-07 15:19:56 +02:00
..
__init__.py	chore: turn OpenAIMixin into a pydantic.BaseModel (#3671 )	2025-10-06 11:33:19 -04:00
config.py	feat: add refresh_models support to inference adapters (default: false) (#3719 )	2025-10-07 15:19:56 +02:00
vllm.py	feat: add refresh_models support to inference adapters (default: false) (#3719 )	2025-10-07 15:19:56 +02:00