llama-stack-mirror/docs/docs/providers/inference/remote_runpod.mdx

---
description: "RunPod inference provider for running models on RunPod's cloud GPU platform."
sidebar_label: Remote - Runpod
title: remote::runpod
---

# remote::runpod

## Description

RunPod inference provider for running models on RunPod's cloud GPU platform.

## Configuration

| Field | Type | Required | Default | Description |
|-------|------|----------|---------|-------------|
| `allowed_models` | `list[str] \| None` | No |  | List of models that should be registered with the model registry. If None, all models are allowed. |
| `refresh_models` | `bool` | No | False | Whether to refresh models periodically from the provider |
| `api_token` | `SecretStr \| None` | No |  | The API token |
| `base_url` | `HttpUrl \| None` | No |  | The URL for the Runpod model serving endpoint |

## Sample Configuration

```yaml
base_url: ${env.RUNPOD_URL:=}
api_token: ${env.RUNPOD_API_TOKEN}
```