llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 18:00:36 +00:00

History

mergify[bot] 2d5ed5d0f5 fix: update hard-coded google model names (backport #4212 ) (#4229 ) # What does this PR do? When we send the model names to Google's openai API, we must use the "google" name prefix. Google does not recognize the "vertexai" model names. Closes #4211 ## Test Plan ```bash uv venv --python python312 . .venv/bin/activate llama stack list-deps starter \| xargs -L1 uv pip install llama stack run starter ``` Test that this shows the gemini models with their correct names: ```bash curl http://127.0.0.1:8321/v1/models \| jq '.data \| map(select(.custom_metadata.provider_id == "vertexai"))' ``` Test that this chat completion works: ```bash curl -X POST -H "Content-Type: application/json" "http://127.0.0.1:8321/v1/chat/completions" -d '{ "model": "vertexai/google/gemini-2.5-flash", "messages": [ { "role": "system", "content": "You are a helpful assistant." }, { "role": "user", "content": "Hello! Can you tell me a joke?" } ], "temperature": 1.0, "max_tokens": 256 }' ```<hr>This is an automatic backport of pull request #4212 done by [Mergify](https://mergify.com). Signed-off-by: Charlie Doern <cdoern@redhat.com> Co-authored-by: Ken Dreyer <kdreyer@redhat.com>		2025-11-24 11:32:14 -08:00
..
inline	fix: Vector store persistence across server restarts (backport #3977 ) (#4225 )	2025-11-24 11:30:21 -08:00
registry	revert: "chore(cleanup)!: remove tool_runtime.rag_tool" (#3877 )	2025-10-21 11:22:06 -07:00
remote	fix: update hard-coded google model names (backport #4212 ) (#4229 )	2025-11-24 11:32:14 -08:00
utils	fix: enforce allowed_models during inference requests (backport #4197 ) (#4228 )	2025-11-24 11:31:36 -08:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
datatypes.py	chore(cleanup)!: kill vector_db references as far as possible (#3864 )	2025-10-20 20:06:16 -07:00