llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-23 07:59:44 +00:00

History

Ashwin Bharambe 1463b79218 feat(registry): make the Stack query providers for model listing (#2862 ) This flips #2823 and #2805 by making the Stack periodically query the providers for models rather than the providers going behind the back and calling "register" on to the registry themselves. This also adds support for model listing for all other providers via `ModelRegistryHelper`. Once this is done, we do not need to manually list or register models via `run.yaml` and it will remove both noise and annoyance (setting `INFERENCE_MODEL` environment variables, for example) from the new user experience. In addition, it adds a configuration variable `allowed_models` which can be used to optionally restrict the set of models exposed from a provider.		2025-07-24 10:39:53 -07:00
..
advanced_apis	docs: Reorganize documentation on the webpage (#2651 )	2025-07-15 14:19:35 -07:00
building_applications	docs: Document use cases for Responses and Agents APIs (#2756 )	2025-07-24 12:20:04 -04:00
concepts	docs: update list of apis (#2697 )	2025-07-24 09:50:14 -07:00
contributing	chore: create OpenAIMixin for inference providers with an OpenAI-compat API that need to implement openai_* methods (#2835 )	2025-07-23 06:49:40 -04:00
deploying	chore: update k8s template (#2786 )	2025-07-16 15:07:26 -07:00
distributions	chore: add some documentation for access policy rules (#2785 )	2025-07-23 10:27:27 +02:00
getting_started	chore: Adding demo script and importing it into the docs (#2848 )	2025-07-21 22:53:32 -04:00
providers	feat(registry): make the Stack query providers for model listing (#2862 )	2025-07-24 10:39:53 -07:00
references	docs: update outdated llama stack client documentation (#2758 )	2025-07-15 11:49:59 -07:00
conf.py	docs: Reorganize documentation on the webpage (#2651 )	2025-07-15 14:19:35 -07:00
index.md	docs: Reorganize documentation on the webpage (#2651 )	2025-07-15 14:19:35 -07:00