llama-stack-mirror/docs/source/providers/inference/remote_vllm.md
Sébastien Han c9a49a80e8
docs: auto generated documentation for providers (#2543)
# What does this PR do?

Simple approach to get some provider pages in the docs.

Add or update description fields in the provider configuration class
using Pydantic’s Field, ensuring these descriptions are clear and
complete, as they will be used to auto-generate provider documentation
via ./scripts/distro_codegen.py instead of editing the docs manually.

Signed-off-by: Sébastien Han <seb@redhat.com>
2025-06-30 15:13:20 +02:00

763 B

remote::vllm

Description

Remote vLLM inference provider for connecting to vLLM servers.

Configuration

Field Type Required Default Description
url str | None No The URL for the vLLM model serving endpoint
max_tokens <class 'int'> No 4096 Maximum number of tokens to generate.
api_token str | None No fake The API token
tls_verify bool | str No True Whether to verify TLS certificates. Can be a boolean or a path to a CA certificate file.

Sample Configuration

url: ${env.VLLM_URL}
max_tokens: ${env.VLLM_MAX_TOKENS:=4096}
api_token: ${env.VLLM_API_TOKEN:=fake}
tls_verify: ${env.VLLM_TLS_VERIFY:=true}