llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-06-27 18:50:41 +00:00

History

Ashwin Bharambe 7fb4bdabea docs(kubernetes): add more fleshed-out example of a Demo Kubernetes cluster (#2329 ) This Kubernetes cluster has: - vLLM for serving an inference model - vLLM for serving a safety model - Postgres DB (for metadata and other state for the Llama Stack distro) - Chroma DB for Vector IO (memory) Perhaps most importantly, this was me trying to learn Kubernetes for the first time. ## Test Plan Run `sh apply.sh` against an EKS cluster, then after `kubectl port-forward service/llama-stack-service 8321:8321` and after many attempts, we have finally: <img width="1589" alt="image" src="https://github.com/user-attachments/assets/c69f242d-6aaa-4def-9f7c-172113b8bfc1" /> <img width="1978" alt="image" src="https://github.com/user-attachments/assets/cf678404-f551-4fa5-9077-bebe3e8e8ae8" />		2025-06-02 13:07:08 -07:00
..
building_applications	feat: Enable ingestion of precomputed embeddings (#2317 )	2025-05-31 04:03:37 -06:00
concepts	docs: fix typos in evaluation concepts (#1745 )	2025-03-21 12:00:53 -07:00
contributing	docs: revamp testing documentation (#2155 )	2025-05-13 11:28:29 -07:00
distributions	docs(kubernetes): add more fleshed-out example of a Demo Kubernetes cluster (#2329 )	2025-06-02 13:07:08 -07:00
getting_started	docs: Remove datasets.rst and fix llama-stack build commands (#2061 )	2025-05-06 09:51:20 -07:00
introduction	docs: Remove mentions of focus on Llama models (#1690 )	2025-03-19 00:17:22 -04:00
playground	chore: simplify running the demo UI (#1907 )	2025-04-09 11:22:29 -07:00
providers	docs: add post training to providers list (#2280 )	2025-05-28 09:32:00 -04:00
references	chore: remove last instances of code-interpreter provider (#2143 )	2025-05-12 10:54:43 -07:00
conf.py	fix: use pypi browser agent (#2260 )	2025-05-24 23:26:30 -07:00
index.md	docs: fixes to quick start (#1943 )	2025-04-11 13:41:23 -07:00