mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-07-04 05:12:35 +00:00
# What does this PR do? Simple approach to get some provider pages in the docs. Add or update description fields in the provider configuration class using Pydantic’s Field, ensuring these descriptions are clear and complete, as they will be used to auto-generate provider documentation via ./scripts/distro_codegen.py instead of editing the docs manually. Signed-off-by: Sébastien Han <seb@redhat.com>
763 B
763 B
remote::vllm
Description
Remote vLLM inference provider for connecting to vLLM servers.
Configuration
Field | Type | Required | Default | Description |
---|---|---|---|---|
url |
str | None |
No | The URL for the vLLM model serving endpoint | |
max_tokens |
<class 'int'> |
No | 4096 | Maximum number of tokens to generate. |
api_token |
str | None |
No | fake | The API token |
tls_verify |
bool | str |
No | True | Whether to verify TLS certificates. Can be a boolean or a path to a CA certificate file. |
Sample Configuration
url: ${env.VLLM_URL}
max_tokens: ${env.VLLM_MAX_TOKENS:=4096}
api_token: ${env.VLLM_API_TOKEN:=fake}
tls_verify: ${env.VLLM_TLS_VERIFY:=true}