llama-stack-mirror/docs/source
Ashwin Bharambe 1463b79218
feat(registry): make the Stack query providers for model listing (#2862)
This flips #2823 and #2805 by making the Stack periodically query the
providers for models rather than the providers going behind the back and
calling "register" on to the registry themselves. This also adds support
for model listing for all other providers via `ModelRegistryHelper`.
Once this is done, we do not need to manually list or register models
via `run.yaml` and it will remove both noise and annoyance (setting
`INFERENCE_MODEL` environment variables, for example) from the new user
experience.

In addition, it adds a configuration variable `allowed_models` which can
be used to optionally restrict the set of models exposed from a
provider.
2025-07-24 10:39:53 -07:00
..
advanced_apis docs: Reorganize documentation on the webpage (#2651) 2025-07-15 14:19:35 -07:00
building_applications docs: Document use cases for Responses and Agents APIs (#2756) 2025-07-24 12:20:04 -04:00
concepts docs: update list of apis (#2697) 2025-07-24 09:50:14 -07:00
contributing chore: create OpenAIMixin for inference providers with an OpenAI-compat API that need to implement openai_* methods (#2835) 2025-07-23 06:49:40 -04:00
deploying chore: update k8s template (#2786) 2025-07-16 15:07:26 -07:00
distributions chore: add some documentation for access policy rules (#2785) 2025-07-23 10:27:27 +02:00
getting_started chore: Adding demo script and importing it into the docs (#2848) 2025-07-21 22:53:32 -04:00
providers feat(registry): make the Stack query providers for model listing (#2862) 2025-07-24 10:39:53 -07:00
references docs: update outdated llama stack client documentation (#2758) 2025-07-15 11:49:59 -07:00
conf.py docs: Reorganize documentation on the webpage (#2651) 2025-07-15 14:19:35 -07:00
index.md docs: Reorganize documentation on the webpage (#2651) 2025-07-15 14:19:35 -07:00