llama-stack-mirror/llama_stack/distribution
Ashwin Bharambe 1463b79218
feat(registry): make the Stack query providers for model listing (#2862)
This flips #2823 and #2805 by making the Stack periodically query the
providers for models rather than the providers going behind the back and
calling "register" on to the registry themselves. This also adds support
for model listing for all other providers via `ModelRegistryHelper`.
Once this is done, we do not need to manually list or register models
via `run.yaml` and it will remove both noise and annoyance (setting
`INFERENCE_MODEL` environment variables, for example) from the new user
experience.

In addition, it adds a configuration variable `allowed_models` which can
be used to optionally restrict the set of models exposed from a
provider.
2025-07-24 10:39:53 -07:00
..
access_control fix: auth sql store: user is owner policy (#2674) 2025-07-10 14:40:32 -07:00
routers feat(registry): more flexible model lookup (#2859) 2025-07-22 15:22:48 -07:00
routing_tables feat(registry): make the Stack query providers for model listing (#2862) 2025-07-24 10:39:53 -07:00
server fix: honour deprecation of --config and --template (#2856) 2025-07-22 20:48:23 -07:00
store fix: store configs (#2593) 2025-07-03 10:07:23 -07:00
ui chore: remove nested imports (#2515) 2025-06-26 08:01:05 +05:30
utils chore: merge --config and --template in server.py (#2716) 2025-07-21 13:19:27 -07:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00
build.py chore: bump python supported version to 3.12 (#2475) 2025-06-24 09:22:04 +05:30
build_conda_env.sh chore: fix build script bug (#2507) 2025-06-24 12:05:22 -07:00
build_container.sh fix: prevent shell redirection issues with pip dependencies (#2867) 2025-07-23 21:43:33 +02:00
build_venv.sh chore: remove straggler references to llama-models (#1345) 2025-03-01 14:26:03 -08:00
client.py chore: make cprint write to stderr (#2250) 2025-05-24 23:39:57 -07:00
common.sh feat(pre-commit): enhance pre-commit hooks with additional checks (#2014) 2025-04-30 11:35:49 -07:00
configure.py fix: stop image_name from being cast to an integer (#2759) 2025-07-15 09:44:21 -07:00
datatypes.py feat(registry): make the Stack query providers for model listing (#2862) 2025-07-24 10:39:53 -07:00
distribution.py ci: fix external provider test (#2438) 2025-06-12 16:14:32 +02:00
inspect.py chore: add mypy coverage to inspect.py and library_client.py in /distribution (#2707) 2025-07-24 09:51:46 -07:00
library_client.py feat(registry): make the Stack query providers for model listing (#2862) 2025-07-24 10:39:53 -07:00
providers.py feat: consolidate most distros into "starter" (#2516) 2025-07-04 15:58:03 +02:00
request_headers.py feat: fine grained access control policy (#2264) 2025-06-03 14:51:12 -07:00
resolver.py fix: de-clutter llama stack run logs (#2783) 2025-07-16 09:44:26 -07:00
stack.py feat(registry): make the Stack query providers for model listing (#2862) 2025-07-24 10:39:53 -07:00
start_stack.sh chore: merge --config and --template in server.py (#2716) 2025-07-21 13:19:27 -07:00