mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-07-04 21:25:23 +00:00
# What does this PR do? Simple approach to get some provider pages in the docs. Add or update description fields in the provider configuration class using Pydantic’s Field, ensuring these descriptions are clear and complete, as they will be used to auto-generate provider documentation via ./scripts/distro_codegen.py instead of editing the docs manually. Signed-off-by: Sébastien Han <seb@redhat.com>
25 lines
763 B
Markdown
25 lines
763 B
Markdown
# remote::vllm
|
|
|
|
## Description
|
|
|
|
Remote vLLM inference provider for connecting to vLLM servers.
|
|
|
|
## Configuration
|
|
|
|
| Field | Type | Required | Default | Description |
|
|
|-------|------|----------|---------|-------------|
|
|
| `url` | `str \| None` | No | | The URL for the vLLM model serving endpoint |
|
|
| `max_tokens` | `<class 'int'>` | No | 4096 | Maximum number of tokens to generate. |
|
|
| `api_token` | `str \| None` | No | fake | The API token |
|
|
| `tls_verify` | `bool \| str` | No | True | Whether to verify TLS certificates. Can be a boolean or a path to a CA certificate file. |
|
|
|
|
## Sample Configuration
|
|
|
|
```yaml
|
|
url: ${env.VLLM_URL}
|
|
max_tokens: ${env.VLLM_MAX_TOKENS:=4096}
|
|
api_token: ${env.VLLM_API_TOKEN:=fake}
|
|
tls_verify: ${env.VLLM_TLS_VERIFY:=true}
|
|
|
|
```
|
|
|