llama-stack-mirror/docs/source/distributions/self_hosted_distro
Jash Gulabrai 40e2c97915
feat: Add Nvidia e2e beginner notebook and tool calling notebook (#1964)
# What does this PR do?
This PR contains two sets of notebooks that serve as reference material
for developers getting started with Llama Stack using the NVIDIA
Provider. Developers should be able to execute these notebooks
end-to-end, pointing to their NeMo Microservices deployment.
1. `beginner_e2e/`: Notebook that walks through a beginner end-to-end
workflow that covers creating datasets, running inference, customizing
and evaluating models, and running safety checks.
2. `tool_calling/`: Notebook that is ported over from the [Data Flywheel
& Tool Calling
notebook](https://github.com/NVIDIA/GenerativeAIExamples/tree/main/nemo/data-flywheel)
that is referenced in the NeMo Microservices docs. I updated the
notebook to use the Llama Stack client wherever possible, and added
relevant instructions.

[//]: # (If resolving an issue, uncomment and update the line below)
[//]: # (Closes #[issue-number])

## Test Plan
- Both notebook folders contain READMEs with pre-requisites. To manually
test these notebooks, you'll need to have a deployment of the NeMo
Microservices Platform and update the `config.py` file with your
deployment's information.
- I've run through these notebooks manually end-to-end to verify each
step works.

[//]: # (## Documentation)

---------

Co-authored-by: Jash Gulabrai <jgulabrai@nvidia.com>
2025-06-16 11:29:01 -04:00
..
bedrock.md fix: remove code interpeter implementation (#2087) 2025-05-01 14:35:08 -07:00
cerebras.md fix: replace all instances of --yaml-config with --config (#2196) 2025-05-16 14:31:12 -07:00
dell-tgi.md fix: docker run with --pull always to fetch the latest image (#1733) 2025-03-20 15:35:48 -07:00
dell.md fix: replace all instances of --yaml-config with --config (#2196) 2025-05-16 14:31:12 -07:00
fireworks.md feat: reference implementation for files API (#2330) 2025-06-02 21:54:24 -07:00
groq.md fix: remove code interpeter implementation (#2087) 2025-05-01 14:35:08 -07:00
meta-reference-gpu.md fix: remove code interpeter implementation (#2087) 2025-05-01 14:35:08 -07:00
nvidia.md feat: Add Nvidia e2e beginner notebook and tool calling notebook (#1964) 2025-06-16 11:29:01 -04:00
ollama.md feat: File search tool for Responses API (#2426) 2025-06-13 14:32:48 -04:00
passthrough.md fix: remove code interpeter implementation (#2087) 2025-05-01 14:35:08 -07:00
remote-vllm.md fix: replace all instances of --yaml-config with --config (#2196) 2025-05-16 14:31:12 -07:00
sambanova.md feat(providers): sambanova safety provider (#2221) 2025-05-21 15:33:02 -07:00
tgi.md fix: replace all instances of --yaml-config with --config (#2196) 2025-05-16 14:31:12 -07:00
together.md fix: revert "feat(provider): adding llama4 support in together inference provider (#2123)" (#2124) 2025-05-08 15:18:16 -07:00