llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-07-27 14:38:49 +00:00

History

Ashwin Bharambe 1463b79218 feat(registry): make the Stack query providers for model listing (#2862 ) This flips #2823 and #2805 by making the Stack periodically query the providers for models rather than the providers going behind the back and calling "register" on to the registry themselves. This also adds support for model listing for all other providers via `ModelRegistryHelper`. Once this is done, we do not need to manually list or register models via `run.yaml` and it will remove both noise and annoyance (setting `INFERENCE_MODEL` environment variables, for example) from the new user experience. In addition, it adds a configuration variable `allowed_models` which can be used to optionally restrict the set of models exposed from a provider.		2025-07-24 10:39:53 -07:00
..
__init__.py	chore: enable pyupgrade fixes (#1806 )	2025-05-01 14:23:50 -07:00
embedding_mixin.py	feat(registry): more flexible model lookup (#2859 )	2025-07-22 15:22:48 -07:00
inference_store.py	feat: support auth attributes in inference/responses stores (#2389 )	2025-06-20 10:24:45 -07:00
litellm_openai_mixin.py	feat: create dynamic model registration for OpenAI and Llama compat remote inference providers (#2745 )	2025-07-16 12:49:38 -04:00
model_registry.py	feat(registry): make the Stack query providers for model listing (#2862 )	2025-07-24 10:39:53 -07:00
openai_compat.py	chore: remove nested imports (#2515 )	2025-06-26 08:01:05 +05:30
openai_mixin.py	chore: create OpenAIMixin for inference providers with an OpenAI-compat API that need to implement openai_* methods (#2835 )	2025-07-23 06:49:40 -04:00
prompt_adapter.py	fix(ollama): Download remote image URLs for Ollama (#2551 )	2025-06-30 20:36:11 +05:30
stream_utils.py	feat: drop python 3.10 support (#2469 )	2025-06-19 12:07:14 +05:30