mirror of https://github.com/meta-llama/llama-stack.git synced 2025-07-27 06:28:50 +00:00

Kelly Brown 04659df79d docs: fix warnings in documentation generation

2025-07-23 10:26:13 -04:00

908 B

Raw Blame History

orphan
true

remote::nvidia

Description

NVIDIA inference provider for accessing NVIDIA NIM models and AI services.

Configuration

Field	Type	Required	Default	Description
`url`	`<class 'str'>`	No	https://integrate.api.nvidia.com	A base url for accessing the NVIDIA NIM
`api_key`	`pydantic.types.SecretStr \| None`	No		The NVIDIA API key, only needed of using the hosted service
`timeout`	`<class 'int'>`	No	60	Timeout for the HTTP requests
`append_api_version`	`<class 'bool'>`	No	True	When set to false, the API version will not be appended to the base_url. By default, it is true.

Sample Configuration

url: ${env.NVIDIA_BASE_URL:=https://integrate.api.nvidia.com}
api_key: ${env.NVIDIA_API_KEY:=}
append_api_version: ${env.NVIDIA_APPEND_API_VERSION:=True}

908 B Raw Blame History

remote::nvidia

Description

Configuration

Sample Configuration

908 B

Raw Blame History