Add Kubernetes deployment guide (#899)

This PR moves some content from [the recent blog post](https://blog.vllm.ai/2025/01/27/intro-to-llama-stack-with-vllm.html) to here as a more official guide for users who'd like to deploy Llama Stack on Kubernetes. --------- Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-12-04 18:13:44 +00:00 · 2025-02-06 13:28:02 -05:00 · 2025-02-06 13:28:02 -05:00 · 09ed0e9c9f
commit 09ed0e9c9f
parent a25e3b405c
2 changed files with 214 additions and 1 deletions
--- a/docs/source/distributions/index.md
+++ b/docs/source/distributions/index.md
@ -14,7 +14,12 @@ Another simple way to start interacting with Llama Stack is to just spin up a co

 **Conda**:

-Lastly, if you have a custom or an advanced setup or you are developing on Llama Stack you can also build a custom Llama Stack server. Using `llama stack build` and `llama stack run` you can build/run a custom Llama Stack server containing the exact combination of providers you wish. We have also provided various templates to make getting started easier. See [Building a Custom Distribution](building_distro) for more details.
+If you have a custom or an advanced setup or you are developing on Llama Stack you can also build a custom Llama Stack server. Using `llama stack build` and `llama stack run` you can build/run a custom Llama Stack server containing the exact combination of providers you wish. We have also provided various templates to make getting started easier. See [Building a Custom Distribution](building_distro) for more details.
+
+
+**Kubernetes**:
+
+If you have built a container image and want to deploy it in a Kubernetes cluster instead of starting the Llama Stack server locally. See [Kubernetes Deployment Guide](kubernetes_deployment) for more details.


 ```{toctree}
@ -25,4 +30,5 @@ importing_as_library
 building_distro
 configuration
 selection
+kubernetes_deployment
 ```