fix broken --list-templates with adding build.yaml files for packaging (#327)

* add build files to templates * fix templates * manifest * symlink * symlink * precommit * change everything to docker build.yaml * remove image_type in templates * fix build from templates CLI * fix readmes
2025-12-09 03:19:20 +00:00 · 2024-10-25 12:51:22 -07:00 · 2024-10-25 12:51:22 -07:00 · 07f9bf723f
commit 07f9bf723f
parent afae4e3d8e
32 changed files with 161 additions and 158 deletions
--- a/docs/getting_started.md
+++ b/docs/getting_started.md
@ -35,11 +35,7 @@ You have two ways to start up Llama stack server:

 1. **Starting up server via docker**:

-	We provide 2 pre-built Docker image of Llama Stack distribution, which can be found in the following links.
-	- [llamastack-local-gpu](https://hub.docker.com/repository/docker/llamastack/llamastack-local-gpu/general)
-	- This is a packaged version with our local meta-reference implementations, where you will be running inference locally with downloaded Llama model checkpoints.
-	- [llamastack-local-cpu](https://hub.docker.com/repository/docker/llamastack/llamastack-local-cpu/general)
-	- This is a lite version with remote inference where you can hook up to your favourite remote inference framework (e.g. ollama, fireworks, together, tgi) for running inference without GPU.
+	We provide pre-built Docker image of Llama Stack distribution, which can be found in the following links in the [distributions](../distributions/) folder.

 	> [!NOTE]
 	> For GPU inference, you need to set these environment variables for specifying local directory containing your model checkpoints, and enable GPU inference to start running docker container.