mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-10-07 20:50:52 +00:00
- update Cerebras to use OpenAIMixin - enable openai completions tests - enable openai chat completions tests - disable with n > 1 tests - add recording for --setup cerebras --subdirs inference --pattern openai test with: `./scripts/integration-tests.sh --stack-config server:ci-tests --setup cerebras --subdirs inference --pattern openai`
21 lines
519 B
Markdown
21 lines
519 B
Markdown
# remote::cerebras
|
|
|
|
## Description
|
|
|
|
Cerebras inference provider for running models on Cerebras Cloud platform.
|
|
|
|
## Configuration
|
|
|
|
| Field | Type | Required | Default | Description |
|
|
|-------|------|----------|---------|-------------|
|
|
| `base_url` | `<class 'str'>` | No | https://api.cerebras.ai | Base URL for the Cerebras API |
|
|
| `api_key` | `<class 'pydantic.types.SecretStr'>` | No | | Cerebras API Key |
|
|
|
|
## Sample Configuration
|
|
|
|
```yaml
|
|
base_url: https://api.cerebras.ai
|
|
api_key: ${env.CEREBRAS_API_KEY:=}
|
|
|
|
```
|
|
|