Merge branch 'evals_6' into evals_7

2025-12-16 03:22:36 +00:00 · 2024-10-25 12:55:51 -07:00 · 2024-10-25 12:55:51 -07:00 · 575e51eb76
commit 575e51eb76
parent ec7c8f95de d95bef7f2e
51 changed files with 448 additions and 420 deletions
--- a/docs/cli_reference.md
+++ b/docs/cli_reference.md
@ -279,11 +279,11 @@ llama stack build --list-templates
 You may then pick a template to build your distribution with providers fitted to your liking.

 ```
-llama stack build --template local-tgi --name my-tgi-stack
+llama stack build --template local-tgi --name my-tgi-stack --image-type conda
 ```

 ```
-$ llama stack build --template local-tgi --name my-tgi-stack
+$ llama stack build --template local-tgi --name my-tgi-stack --image-type conda
 ...
 ...
 Build spec configuration saved at ~/.conda/envs/llamastack-my-tgi-stack/my-tgi-stack-build.yaml
@ -293,10 +293,10 @@ You may now run `llama stack configure my-tgi-stack` or `llama stack configure ~
 #### Building from config file
 - In addition to templates, you may customize the build to your liking through editing config files and build from config files with the following command.

- The config file will be of contents like the ones in `llama_stack/distributions/templates/`.
+- The config file will be of contents like the ones in `llama_stack/templates/`.

 ```
-$ cat llama_stack/distribution/templates/local-ollama-build.yaml
+$ cat build.yaml

 name: local-ollama
 distribution_spec:
@ -311,7 +311,7 @@ image_type: conda
 ```

 ```
-llama stack build --config llama_stack/distribution/templates/local-ollama-build.yaml
+llama stack build --config build.yaml
 ```

 #### How to build distribution with Docker image
--- a/docs/getting_started.md
+++ b/docs/getting_started.md
@ -35,11 +35,7 @@ You have two ways to start up Llama stack server:

 1. **Starting up server via docker**:

-	We provide 2 pre-built Docker image of Llama Stack distribution, which can be found in the following links.
-	- [llamastack-local-gpu](https://hub.docker.com/repository/docker/llamastack/llamastack-local-gpu/general)
-	- This is a packaged version with our local meta-reference implementations, where you will be running inference locally with downloaded Llama model checkpoints.
-	- [llamastack-local-cpu](https://hub.docker.com/repository/docker/llamastack/llamastack-local-cpu/general)
-	- This is a lite version with remote inference where you can hook up to your favourite remote inference framework (e.g. ollama, fireworks, together, tgi) for running inference without GPU.
+	We provide pre-built Docker image of Llama Stack distribution, which can be found in the following links in the [distributions](../distributions/) folder.

 	> [!NOTE]
 	> For GPU inference, you need to set these environment variables for specifying local directory containing your model checkpoints, and enable GPU inference to start running docker container.