rename LLAMASTACK_PORT to LLAMA_STACK_PORT for consistency with other env vars

2026-01-03 13:02:16 +00:00 · 2025-01-10 09:00:22 -08:00 · 2025-01-10 09:00:22 -08:00 · 36dcf00653
commit 36dcf00653
parent 027a46ddd7
25 changed files with 25 additions and 25 deletions
--- a/docs/source/distributions/self_hosted_distro/remote-vllm.md
+++ b/docs/source/distributions/self_hosted_distro/remote-vllm.md
@ -27,7 +27,7 @@ You can use this distribution if you have GPUs and want to run an independent vL

 The following environment variables can be configured:

- `LLAMASTACK_PORT`: Port for the Llama Stack distribution server (default: `5001`)
+- `LLAMA_STACK_PORT`: Port for the Llama Stack distribution server (default: `5001`)
 - `INFERENCE_MODEL`: Inference model loaded into the vLLM server (default: `meta-llama/Llama-3.2-3B-Instruct`)
 - `VLLM_URL`: URL of the vLLM server with the main inference model (default: `http://host.docker.internal:5100/v1`)
 - `MAX_TOKENS`: Maximum number of tokens for generation (default: `4096`)