From 6aa459b00c55c31bcd265c6876bdb0f6f1d70123 Mon Sep 17 00:00:00 2001 From: Mark Campbell Date: Fri, 11 Apr 2025 12:04:13 +0100 Subject: [PATCH] docs: fix errors in kubernetes deployment guide (#1914) # What does this PR do? [Provide a short summary of what this PR does and why. Link to relevant issues if applicable.] Fixes a couple of errors in PVC/Secret setup and adds context for expected Hugging Face token [//]: # (If resolving an issue, uncomment and update the line below) [//]: # (Closes #[issue-number]) ## Test Plan [Describe the tests you ran to verify your changes with result summaries. *Provide clear instructions so the plan can be easily re-executed.*] [//]: # (## Documentation) --- docs/source/distributions/kubernetes_deployment.md | 14 ++++++++++---- 1 file changed, 10 insertions(+), 4 deletions(-) diff --git a/docs/source/distributions/kubernetes_deployment.md b/docs/source/distributions/kubernetes_deployment.md index 2daf9d785..21ec02012 100644 --- a/docs/source/distributions/kubernetes_deployment.md +++ b/docs/source/distributions/kubernetes_deployment.md @@ -11,7 +11,12 @@ First, create a local Kubernetes cluster via Kind: kind create cluster --image kindest/node:v1.32.0 --name llama-stack-test ``` -First, create a Kubernetes PVC and Secret for downloading and storing Hugging Face model: +First set your hugging face token as an environment variable. +``` +export HF_TOKEN=$(echo -n "your-hf-token" | base64) +``` + +Now create a Kubernetes PVC and Secret for downloading and storing Hugging Face model: ``` cat </tmp/test-vllm-llama-stack/Containerfile.llama-stack-run-k8s <$tmp_dir/Containerfile.llama-stack-run-k8s <