llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-04 02:03:44 +00:00

History

Ken Dreyer 085126e530 fix: update hard-coded google model names (#4212 ) When we send the model names to Google's openai API, we must use the "google" name prefix. Google does not recognize the "vertexai" model names. Closes #4211 ```bash uv venv --python python312 . .venv/bin/activate llama stack list-deps starter \| xargs -L1 uv pip install llama stack run starter ``` Test that this shows the gemini models with their correct names: ```bash curl http://127.0.0.1:8321/v1/models \| jq '.data \| map(select(.custom_metadata.provider_id == "vertexai"))' ``` Test that this chat completion works: ```bash curl -X POST -H "Content-Type: application/json" "http://127.0.0.1:8321/v1/chat/completions" -d '{ "model": "vertexai/google/gemini-2.5-flash", "messages": [ { "role": "system", "content": "You are a helpful assistant." }, { "role": "user", "content": "Hello! Can you tell me a joke?" } ], "temperature": 1.0, "max_tokens": 256 }' ``` (cherry picked from commit `dabebdd230`) Signed-off-by: Charlie Doern <cdoern@redhat.com>		2025-11-24 14:17:12 -05:00
..
apis	revert: "chore(cleanup)!: remove tool_runtime.rag_tool" (#3877 )	2025-10-21 11:22:06 -07:00
cli	fix: print help for list-deps if no args (backport #4078 ) (#4083 )	2025-11-05 14:58:47 -08:00
core	fix(inference): enable routing of models with provider_data alone (backport #3928 ) (#4142 )	2025-11-12 13:41:27 -08:00
distributions	fix: harden storage semantics (backport #4118 ) (#4138 )	2025-11-12 13:01:21 -08:00
models	chore: remove dead code (#3729 )	2025-10-07 20:26:02 -07:00
providers	fix: update hard-coded google model names (#4212 )	2025-11-24 14:17:12 -05:00
strong_typing	chore: refactor (chat)completions endpoints to use shared params struct (#3761 )	2025-10-10 15:46:34 -07:00
testing	feat(ci): add support for docker:distro in tests (#3832 )	2025-10-16 19:33:13 -07:00
ui	build: Bump version to 0.3.2	2025-11-12 23:19:12 +00:00
__init__.py	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
env.py	refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401 )	2025-03-04 14:53:47 -08:00
log.py	fix(logs): restore uvicorn and llama_stack logger settings	2025-10-21 15:47:55 -07:00
schema_utils.py	fix(auth): allow unauthenticated access to health and version endpoints (#3736 )	2025-10-10 13:41:43 -07:00