llama-stack-mirror/docs/docs/providers/inference/remote_vertexai.mdx

---
description: |
  Google Vertex AI inference provider enables you to use Google's Gemini models through Google Cloud's Vertex AI platform, providing several advantages:

  • Enterprise-grade security: Uses Google Cloud's security controls and IAM
  • Better integration: Seamless integration with other Google Cloud services
  • Advanced features: Access to additional Vertex AI features like model tuning and monitoring
  • Authentication: Uses Google Cloud Application Default Credentials (ADC) instead of API keys

  Configuration:
  - Set VERTEX_AI_PROJECT environment variable (required)
  - Set VERTEX_AI_LOCATION environment variable (optional, defaults to us-central1)
  - Use Google Cloud Application Default Credentials or service account key

  Authentication Setup:
  Option 1 (Recommended): gcloud auth application-default login
  Option 2: Set GOOGLE_APPLICATION_CREDENTIALS to service account key path

  Available Models:
  - vertex_ai/gemini-2.0-flash
  - vertex_ai/gemini-2.5-flash
  - vertex_ai/gemini-2.5-pro
sidebar_label: Remote - Vertexai
title: remote::vertexai
---

# remote::vertexai

## Description

Google Vertex AI inference provider enables you to use Google's Gemini models through Google Cloud's Vertex AI platform, providing several advantages:

• Enterprise-grade security: Uses Google Cloud's security controls and IAM
• Better integration: Seamless integration with other Google Cloud services
• Advanced features: Access to additional Vertex AI features like model tuning and monitoring
• Authentication: Uses Google Cloud Application Default Credentials (ADC) instead of API keys

Configuration:
- Set VERTEX_AI_PROJECT environment variable (required)
- Set VERTEX_AI_LOCATION environment variable (optional, defaults to us-central1)
- Use Google Cloud Application Default Credentials or service account key

Authentication Setup:
Option 1 (Recommended): gcloud auth application-default login
Option 2: Set GOOGLE_APPLICATION_CREDENTIALS to service account key path

Available Models:
- vertex_ai/gemini-2.0-flash
- vertex_ai/gemini-2.5-flash
- vertex_ai/gemini-2.5-pro

## Configuration

| Field | Type | Required | Default | Description |
|-------|------|----------|---------|-------------|
| `allowed_models` | `list[str \| None` | No |  | List of models that should be registered with the model registry. If None, all models are allowed. |
| `refresh_models` | `<class 'bool'>` | No | False | Whether to refresh models periodically from the provider |
| `project` | `<class 'str'>` | No |  | Google Cloud project ID for Vertex AI |
| `location` | `<class 'str'>` | No | us-central1 | Google Cloud location for Vertex AI |

## Sample Configuration

```yaml
project: ${env.VERTEX_AI_PROJECT:=}
location: ${env.VERTEX_AI_LOCATION:=us-central1}
```