From 931ee14a88def1444b71ad79d3871d97e47f1619 Mon Sep 17 00:00:00 2001 From: Jash Gulabrai Date: Wed, 16 Apr 2025 13:08:03 -0400 Subject: [PATCH] Add reference to e2e notebook --- docs/source/distributions/remote_hosted_distro/nvidia.md | 4 ++++ llama_stack/templates/nvidia/doc_template.md | 4 ++++ 2 files changed, 8 insertions(+) diff --git a/docs/source/distributions/remote_hosted_distro/nvidia.md b/docs/source/distributions/remote_hosted_distro/nvidia.md index 5c5a1ec17..7f0cb82d6 100644 --- a/docs/source/distributions/remote_hosted_distro/nvidia.md +++ b/docs/source/distributions/remote_hosted_distro/nvidia.md @@ -170,3 +170,7 @@ llama stack run ./run.yaml \ --env NVIDIA_API_KEY=$NVIDIA_API_KEY \ --env INFERENCE_MODEL=$INFERENCE_MODEL ``` + +### Example Notebooks +You can reference the Jupyter notebooks in `docs/notebooks/nvidia/` for example usage of these APIs. +- [Llama_Stack_NVIDIA_E2E_Flow.ipynb](/docs/notebooks/nvidia/Llama_Stack_NVIDIA_E2E_Flow.ipynb) contains an end-to-end workflow for running inference, customizing, and evaluating models using your deployed NeMo Microservices platform. diff --git a/llama_stack/templates/nvidia/doc_template.md b/llama_stack/templates/nvidia/doc_template.md index 49120eb1d..ab14c2966 100644 --- a/llama_stack/templates/nvidia/doc_template.md +++ b/llama_stack/templates/nvidia/doc_template.md @@ -144,3 +144,7 @@ llama stack run ./run.yaml \ --env NVIDIA_API_KEY=$NVIDIA_API_KEY \ --env INFERENCE_MODEL=$INFERENCE_MODEL ``` + +### Example Notebooks +You can reference the Jupyter notebooks in `docs/notebooks/nvidia/` for example usage of these APIs. +- [Llama_Stack_NVIDIA_E2E_Flow.ipynb](/docs/notebooks/nvidia/Llama_Stack_NVIDIA_E2E_Flow.ipynb) contains an end-to-end workflow for running inference, customizing, and evaluating models using your deployed NeMo Microservices platform.