llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-07-13 16:46:09 +00:00

History

Ben Browning 5bb3817c49 fix: Restore the nvidia distro (#2639 ) # What does this PR do? The `nvidia` distro was previously collapsed into the `starter` distro. However, the `nvidia` distro was setup specifically to use NVIDIA NeMo microservices as providers for all APIs and not just inference, which means it was doing quite a bit more than what the `starter` distro covers today. We should work with our friends at NVIDIA to determine the best place to maintain this distro long-term, but for now this restores the `nvidia` distro and its docs back to where they were so that things continue to work for their users. ## Test Plan I ensure the `nvidia` distro could build, and run at least to the point of complaining that I didn't provide the necessary API keys. ``` uv run llama stack build --template nvidia --image-type venv uv run llama stack run llama_stack/templates/nvidia/run.yaml ``` I also made sure the docs website built and looks reasonable, with the `nvidia` distro docs at the same URL it was previously (because it has incoming links from official NVIDIA NeMo docs, among other places). ``` uv run --group docs sphinx-autobuild docs/source docs/build/html --write-all ``` Signed-off-by: Ben Browning <bbrownin@redhat.com>		2025-07-07 15:50:05 -07:00
..
building_applications	feat: improve telemetry (#2590 )	2025-07-04 17:29:09 +02:00
concepts	docs: specify the ability to train non-Llama models (#2573 )	2025-07-01 19:29:06 +05:30
contributing	docs: revamp testing documentation (#2155 )	2025-05-13 11:28:29 -07:00
distributions	fix: Restore the nvidia distro (#2639 )	2025-07-07 15:50:05 -07:00
getting_started	refactor: set proper name for embedding all-minilm:l6-v2 and update to use "starter" in detailed_tutorial (#2627 )	2025-07-06 09:07:37 +05:30
introduction	docs: Remove mentions of focus on Llama models (#1690 )	2025-03-19 00:17:22 -04:00
openai	docs: Add OpenAI API compatibility page (#2316 )	2025-06-04 06:51:52 -04:00
playground	chore: simplify running the demo UI (#1907 )	2025-04-09 11:22:29 -07:00
providers	feat: improve telemetry (#2590 )	2025-07-04 17:29:09 +02:00
references	chore: remove last instances of code-interpreter provider (#2143 )	2025-05-12 10:54:43 -07:00
conf.py	fix: use pypi browser agent (#2260 )	2025-05-24 23:26:30 -07:00
index.md	docs: update full list of providers with matched APIs and dockerhub images (#2452 )	2025-07-03 10:12:56 +02:00