forked from phoenix-oss/llama-stack-mirror
Fix issue when generating distros (#755)
Addressed comment https://github.com/meta-llama/llama-stack/pull/723#issuecomment-2581902075. cc @yanxi0830 I am not 100% sure if the diff is correct though but this is the result of running `python llama_stack/scripts/distro_codegen.py`. --------- Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
This commit is contained in:
parent
52a21ce78f
commit
300e6e2702
3 changed files with 133 additions and 143 deletions
|
@ -134,7 +134,7 @@ def get_distribution_template() -> DistributionTemplate:
|
|||
"Inference model loaded into the vLLM server",
|
||||
),
|
||||
"VLLM_URL": (
|
||||
"http://host.docker.internal:5100}/v1",
|
||||
"http://host.docker.internal:5100/v1",
|
||||
"URL of the vLLM server with the main inference model",
|
||||
),
|
||||
"MAX_TOKENS": (
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue