llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-04 02:03:44 +00:00

History

Nathan Weinberg 62b3ad349a fix: return to hardcoded model IDs for Vertex AI (#4041 ) # What does this PR do? partial revert of `b67aef2` Vertex AI doesn't offer an endpoint for listing models from Google's Model Garden Return to hardcoded values until such an endpoint is available Closes #3988 ## Test Plan Server side, set up your Vertex AI env vars (`VERTEX_AI_PROJECT`, `VERTEX_AI_LOCATION`, and `GOOGLE_APPLICATION_CREDENTIALS`) and run the starter distribution ```bash $ llama stack list-deps starter \| xargs -L1 uv pip install $ llama stack run starter ``` Client side, formerly broken cURL requests now working ```bash $ curl http://127.0.0.1:8321/v1/models \| jq '.data \| map(select(.provider_id == "vertexai"))' [ { "identifier": "vertexai/vertex_ai/gemini-2.0-flash", "provider_resource_id": "vertex_ai/gemini-2.0-flash", "provider_id": "vertexai", "type": "model", "metadata": {}, "model_type": "llm" }, { "identifier": "vertexai/vertex_ai/gemini-2.5-flash", "provider_resource_id": "vertex_ai/gemini-2.5-flash", "provider_id": "vertexai", "type": "model", "metadata": {}, "model_type": "llm" }, { "identifier": "vertexai/vertex_ai/gemini-2.5-pro", "provider_resource_id": "vertex_ai/gemini-2.5-pro", "provider_id": "vertexai", "type": "model", "metadata": {}, "model_type": "llm" } ] $ curl -fsS http://127.0.0.1:8321/v1/openai/v1/chat/completions -H "Content-Type: application/json" -d "{\"model\": \"vertexai/vertex_a i/gemini-2.5-flash\", \"messages\": [{\"role\": \"user\", \"content\": \"Hello\"}], \"max_tokens\": 128, \"temperature\": 0.0}" \| jq { "id": "p8oIaYiQF8_PptQPo-GH8QQ", "choices": [ { "finish_reason": "stop", "index": 0, "logprobs": null, "message": { "content": "Hello there! How can I help you today?", "refusal": null, "role": "assistant", "annotations": null, "audio": null, "function_call": null, "tool_calls": null } } ], ... ``` Signed-off-by: Nathan Weinberg <nweinber@redhat.com>		2025-11-03 17:38:16 -08:00
..
agents	chore(package): migrate to src/ layout (#3920 )	2025-10-27 12:02:21 -07:00
datasetio	chore(package): migrate to src/ layout (#3920 )	2025-10-27 12:02:21 -07:00
eval	chore(package): migrate to src/ layout (#3920 )	2025-10-27 12:02:21 -07:00
files	feat: openai files provider (#3946 )	2025-10-28 16:25:03 -07:00
inference	fix: return to hardcoded model IDs for Vertex AI (#4041 )	2025-11-03 17:38:16 -08:00
post_training	chore(package): migrate to src/ layout (#3920 )	2025-10-27 12:02:21 -07:00
safety	chore(package): migrate to src/ layout (#3920 )	2025-10-27 12:02:21 -07:00
tool_runtime	chore(package): migrate to src/ layout (#3920 )	2025-10-27 12:02:21 -07:00
vector_io	chore!: BREAKING CHANGE: vector_db_id -> vector_store_id (#3923 )	2025-10-27 14:26:06 -07:00
__init__.py	chore(package): migrate to src/ layout (#3920 )	2025-10-27 12:02:21 -07:00