mirror of https://github.com/meta-llama/llama-stack.git synced 2025-07-04 21:25:23 +00:00

docs: auto generated documentation for providers (#2543 )

# What does this PR do?

Simple approach to get some provider pages in the docs.

Add or update description fields in the provider configuration class
using Pydantic’s Field, ensuring these descriptions are clear and
complete, as they will be used to auto-generate provider documentation
via ./scripts/distro_codegen.py instead of editing the docs manually.

Signed-off-by: Sébastien Han <seb@redhat.com>

2025-06-30 15:13:20 +02:00

763 B

Raw Blame History

remote::vllm

Description

Remote vLLM inference provider for connecting to vLLM servers.

Configuration

Field	Type	Required	Default	Description
`url`	`str \| None`	No		The URL for the vLLM model serving endpoint
`max_tokens`	`<class 'int'>`	No	4096	Maximum number of tokens to generate.
`api_token`	`str \| None`	No	fake	The API token
`tls_verify`	`bool \| str`	No	True	Whether to verify TLS certificates. Can be a boolean or a path to a CA certificate file.

Sample Configuration

url: ${env.VLLM_URL}
max_tokens: ${env.VLLM_MAX_TOKENS:=4096}
api_token: ${env.VLLM_API_TOKEN:=fake}
tls_verify: ${env.VLLM_TLS_VERIFY:=true}

763 B Raw Blame History

remote::vllm

Description

Configuration

Sample Configuration

763 B

Raw Blame History