mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-10-04 04:04:14 +00:00
- update Cerebras to use OpenAIMixin - enable openai completions tests - enable openai chat completions tests - disable with n > 1 tests - add recording for --setup cerebras --subdirs inference --pattern openai test with: `./scripts/integration-tests.sh --stack-config server:ci-tests --setup cerebras --subdirs inference --pattern openai`
519 B
519 B
remote::cerebras
Description
Cerebras inference provider for running models on Cerebras Cloud platform.
Configuration
Field | Type | Required | Default | Description |
---|---|---|---|---|
base_url |
<class 'str'> |
No | https://api.cerebras.ai | Base URL for the Cerebras API |
api_key |
<class 'pydantic.types.SecretStr'> |
No | Cerebras API Key |
Sample Configuration
base_url: https://api.cerebras.ai
api_key: ${env.CEREBRAS_API_KEY:=}