Fix issue when generating vLLM distros

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2026-01-07 04:39:59 +00:00 · 2025-01-13 18:43:23 -05:00 · 2025-01-13 18:43:23 -05:00 · 7c726826b8
commit 7c726826b8
parent 89e3f81520
3 changed files with 14 additions and 46 deletions
--- a/llama_stack/templates/remote-vllm/vllm.py
+++ b/llama_stack/templates/remote-vllm/vllm.py
@ -134,7 +134,7 @@ def get_distribution_template() -> DistributionTemplate:
                "Inference model loaded into the vLLM server",
            ),
            "VLLM_URL": (
-                "http://host.docker.internal:5100}/v1",
+                "http://host.docker.internal:5100/v1",
                "URL of the vLLM server with the main inference model",
            ),
            "MAX_TOKENS": (