llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 09:53:45 +00:00

History

Kai Wu e3fd70c321 fix: change ModelRegistryHelper to use ProviderModelEntry instead of hardcoded ModelType.llm (#3451 ) # What does this PR do? <!-- Provide a short summary of what this PR does and why. Link to relevant issues if applicable. --> change ModelRegistryHelper to use ProviderModelEntry instead of hardcoded ModelType.llm which fixed issue #3330. <!-- If resolving an issue, uncomment and update the line below --> <!-- Closes #[3330] --> ## Test Plan <!-- Describe the tests you ran to verify your changes with result summaries. Provide clear instructions so the plan can be easily re-executed. --> 1. open llama-stack server ``` uv sync --python 3.12 source .venv/bin/activate uv run llama stack build --distro starter --image-type venv --run ``` 2.Used following script to test ``` from llama_stack_client import LlamaStackClient import os def test_openai_embedding_type(): client = LlamaStackClient( base_url=os.environ.get("LLAMA_STACK_ENDPOINT", "http://localhost:8321"), provider_data={ "openai_api_key": os.environ.get("OPENAI_API_KEY", ""), }, ) model = client.models.retrieve("openai/text-embedding-3-small") print(model) assert model.identifier == "openai/text-embedding-3-small" assert model.model_type == "embedding" test_openai_embedding_type() ``` logs: ``` python test_openai.py INFO:httpx:HTTP Request: GET http://localhost:8321/v1/models/openai/text-embedding-3-small "HTTP/1.1 200 OK" Model(identifier='openai/text-embedding-3-small', metadata={'embedding_dimension': 1536.0, 'context_length': 8192.0}, api_model_type='embedding', provider_id='openai', type='model', provider_resource_id='text-embedding-3-small', owner=None, source='listed_from_provider', model_type='embedding') ```		2025-09-22 12:55:32 -04:00
..
apis	feat: create HTTP DELETE API endpoints to unregister ScoringFn and Benchmark resources in Llama Stack (#3371 )	2025-09-15 12:43:38 -07:00
cli	feat: migrate to FIPS-validated cryptographic algorithms (#3423 )	2025-09-12 11:18:19 +02:00
core	fix: handle missing API keys gracefully in model refresh (#3493 )	2025-09-22 07:31:30 -04:00
distributions	feat: combine ProviderSpec datatypes (#3378 )	2025-09-18 16:10:00 +02:00
models	refactor(logging): rename llama_stack logger categories (#3065 )	2025-08-21 17:31:04 -07:00
providers	fix: change ModelRegistryHelper to use ProviderModelEntry instead of hardcoded ModelType.llm (#3451 )	2025-09-22 12:55:32 -04:00
strong_typing	chore: enable pyupgrade fixes (#1806 )	2025-05-01 14:23:50 -07:00
testing	fix: Update inference recorder to handle both Ollama and OpenAI model (#3470 )	2025-09-21 09:32:39 -04:00
ui	chore(ui-deps): bump jest-environment-jsdom from 29.7.0 to 30.1.2 in /llama_stack/ui (#3509 )	2025-09-22 13:57:10 +02:00
__init__.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
env.py	refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401 )	2025-03-04 14:53:47 -08:00
log.py	chore(pre-commit): add pre-commit hook to enforce llama_stack logger usage (#3061 )	2025-08-20 07:15:35 -04:00
schema_utils.py	feat(auth): API access control (#2822 )	2025-07-24 15:30:48 -07:00