mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-12-03 09:53:45 +00:00
# What does this PR do? Completes #3732 by removing runtime URL transformations and requiring users to provide full URLs in configuration. All providers now use 'base_url' consistently and respect the exact URL provided without appending paths like /v1 or /openai/v1 at runtime. BREAKING CHANGE: Users must update configs to include full URL paths (e.g., http://localhost:11434/v1 instead of http://localhost:11434). Closes #3732 ## Test Plan Existing tests should pass even with the URL changes, due to default URLs being altered. Add unit test to enforce URL standardization across remote inference providers (verifies all use 'base_url' field with HttpUrl | None type) Signed-off-by: Charlie Doern <cdoern@redhat.com>
30 lines
1.1 KiB
Text
30 lines
1.1 KiB
Text
---
|
|
description: "IBM WatsonX inference provider for accessing AI models on IBM's WatsonX platform."
|
|
sidebar_label: Remote - Watsonx
|
|
title: remote::watsonx
|
|
---
|
|
|
|
# remote::watsonx
|
|
|
|
## Description
|
|
|
|
IBM WatsonX inference provider for accessing AI models on IBM's WatsonX platform.
|
|
|
|
## Configuration
|
|
|
|
| Field | Type | Required | Default | Description |
|
|
|-------|------|----------|---------|-------------|
|
|
| `allowed_models` | `list[str] \| None` | No | | List of models that should be registered with the model registry. If None, all models are allowed. |
|
|
| `refresh_models` | `bool` | No | False | Whether to refresh models periodically from the provider |
|
|
| `api_key` | `SecretStr \| None` | No | | Authentication credential for the provider |
|
|
| `base_url` | `HttpUrl \| None` | No | https://us-south.ml.cloud.ibm.com | A base url for accessing the watsonx.ai |
|
|
| `project_id` | `str \| None` | No | | The watsonx.ai project ID |
|
|
| `timeout` | `int` | No | 60 | Timeout for the HTTP requests |
|
|
|
|
## Sample Configuration
|
|
|
|
```yaml
|
|
base_url: ${env.WATSONX_BASE_URL:=https://us-south.ml.cloud.ibm.com}
|
|
api_key: ${env.WATSONX_API_KEY:=}
|
|
project_id: ${env.WATSONX_PROJECT_ID:=}
|
|
```
|