mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-27 06:48:05 +00:00

Francisco Javier Arceo c8d41d45ec chore: Enabling Milvus for VectorIO CI

Signed-off-by: Francisco Javier Arceo <farceo@redhat.com>

2025-06-30 11:55:49 -04:00

remote::runpod

Description

RunPod inference provider for running models on RunPod's cloud GPU platform.

Field	Type	Required	Default	Description
`url`	`str \| None`	No		The URL for the Runpod model serving endpoint
`api_token`	`str \| None`	No		The API token

url: ${env.RUNPOD_URL:+}
api_token: ${env.RUNPOD_API_TOKEN:+}