llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-17 18:19:51 +00:00

History

Charlie Doern 14f96d7079 fix: list models only for active providers There has been an error rolling around where we can retrieve a model when doing something like a chat completion but then we hit issues when trying to associate that model with an active provider. This is a common thing that happens when: 1. you run the stack with say remote::ollama 2. you register a model, say llama3.2:3b 3. you do some completions, etc 4. you kill the server 5. you `unset OLLAMA_URL` 6. you re-start the stack 7. you do `llama-stack-client models list` ``` ├───────────────┼──────────────────────────────────────────────────────────────────────────────────┼──────────────────────────────────────────────────────────────────────┼───────────────────────────────────────┼──────────────────────────┤ │ embedding │ all-minilm │ all-minilm:l6-v2 │ {'embedding_dimension': 384.0, │ ollama │ │ │ │ │ 'context_length': 512.0} │ │ ├───────────────┼──────────────────────────────────────────────────────────────────────────────────┼──────────────────────────────────────────────────────────────────────┼───────────────────────────────────────┼──────────────────────────┤ │ llm │ llama3.2:3b │ llama3.2:3b │ │ ollama │ ├───────────────┼──────────────────────────────────────────────────────────────────────────────────┼──────────────────────────────────────────────────────────────────────┼───────────────────────────────────────┼──────────────────────────┤ │ embedding │ ollama/all-minilm:l6-v2 │ all-minilm:l6-v2 │ {'embedding_dimension': 384.0, │ ollama │ │ │ │ │ 'context_length': 512.0} │ │ ├───────────────┼──────────────────────────────────────────────────────────────────────────────────┼──────────────────────────────────────────────────────────────────────┼───────────────────────────────────────┼──────────────────────────┤ │ llm │ ollama/llama3.2:3b │ llama3.2:3b │ │ ollama │ ├───────────────┼──────────────────────────────────────────────────────────────────────────────────┼──────────────────────────────────────────────────────────────────────┼───────────────────────────────────────┼──────────────────────────┤ ``` This shouldn't be happening, `ollama` isn't a provider running, and the only reason the model is popping up is because its in the dist_registry (on disk). While its nice to have this static store so that if I go and `export OLLAMA_URL=..` again, it can read from the store, it shouldn't _always_ be reading and returning these models from the store now if you `llama-stack-client models list` with this change, no more llama3.2:3b appears. Signed-off-by: Charlie Doern <cdoern@redhat.com>		2025-08-18 16:28:14 -04:00
..
apis	feat: add batches API with OpenAI compatibility (with inference replay) (#3162 )	2025-08-15 15:34:15 -07:00
cli	chore: rename templates to distributions (#3035 )	2025-08-04 11:34:17 -07:00
core	fix: list models only for active providers	2025-08-18 16:28:14 -04:00
distributions	feat: add batches API with OpenAI compatibility (with inference replay) (#3162 )	2025-08-15 15:34:15 -07:00
models	chore(tests): fix responses and vector_io tests (#3119 )	2025-08-12 16:15:53 -07:00
providers	fix(misc): pin openai dependency to < 1.100.0 (#3192 )	2025-08-18 12:20:50 -07:00
strong_typing	chore: enable pyupgrade fixes (#1806 )	2025-05-01 14:23:50 -07:00
testing	fix(recording): endpoint resolution (#3013 )	2025-08-01 16:23:54 -07:00
ui	feat(UI): Adding linter and prettier for UI (#3156 )	2025-08-14 15:58:43 -06:00
__init__.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
env.py	refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401 )	2025-03-04 14:53:47 -08:00
log.py	fix: remove category prints (#3189 )	2025-08-18 10:23:23 -07:00
schema_utils.py	feat(auth): API access control (#2822 )	2025-07-24 15:30:48 -07:00