docs: fix the docs for NVIDIA Inference Provider (#3055)

# What does this PR do? Fix the NVIDIA inference docs by updating API methods, model IDs, and embedding example. ## Test Plan N/A
2025-12-03 09:53:45 +00:00 · 2025-08-08 02:27:55 -07:00 · 2025-08-08 02:27:55 -07:00 · 9e78f2da96
commit 9e78f2da96
parent e90fe25890
3 changed files with 11 additions and 9 deletions
--- a/docs/source/distributions/self_hosted_distro/nvidia.md
+++ b/docs/source/distributions/self_hosted_distro/nvidia.md
@ -157,7 +157,7 @@ docker run \
 If you've set up your local development environment, you can also build the image using your local virtual environment.

 ```bash
-INFERENCE_MODEL=meta-llama/Llama-3.1-8b-Instruct
+INFERENCE_MODEL=meta-llama/Llama-3.1-8B-Instruct
 llama stack build --distro nvidia --image-type venv
 llama stack run ./run.yaml \
  --port 8321 \