llama-stack-mirror/docs/source/providers/inference/remote_runpod.md
Francisco Javier Arceo c8d41d45ec chore: Enabling Milvus for VectorIO CI
Signed-off-by: Francisco Javier Arceo <farceo@redhat.com>
2025-06-30 11:55:49 -04:00

474 B

remote::runpod

Description

RunPod inference provider for running models on RunPod's cloud GPU platform.

Configuration

Field Type Required Default Description
url str | None No The URL for the Runpod model serving endpoint
api_token str | None No The API token

Sample Configuration

url: ${env.RUNPOD_URL:+}
api_token: ${env.RUNPOD_API_TOKEN:+}