llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-05 12:21:52 +00:00

History

Ashwin Bharambe 7fb4bdabea docs(kubernetes): add more fleshed-out example of a Demo Kubernetes cluster (#2329 ) This Kubernetes cluster has: - vLLM for serving an inference model - vLLM for serving a safety model - Postgres DB (for metadata and other state for the Llama Stack distro) - Chroma DB for Vector IO (memory) Perhaps most importantly, this was me trying to learn Kubernetes for the first time. ## Test Plan Run `sh apply.sh` against an EKS cluster, then after `kubectl port-forward service/llama-stack-service 8321:8321` and after many attempts, we have finally: <img width="1589" alt="image" src="https://github.com/user-attachments/assets/c69f242d-6aaa-4def-9f7c-172113b8bfc1" /> <img width="1978" alt="image" src="https://github.com/user-attachments/assets/cf678404-f551-4fa5-9077-bebe3e8e8ae8" />		2025-06-02 13:07:08 -07:00
..
k8s	docs(kubernetes): add more fleshed-out example of a Demo Kubernetes cluster (#2329 )	2025-06-02 13:07:08 -07:00
ondevice_distro	docs: 0.2.2 doc updates (#1961 )	2025-04-15 13:26:17 -07:00
remote_hosted_distro	fix: replace all instances of --yaml-config with --config (#2196 )	2025-05-16 14:31:12 -07:00
self_hosted_distro	feat(providers): sambanova safety provider (#2221 )	2025-05-21 15:33:02 -07:00
building_distro.md	refactor: remove container from list of run image types (#2178 )	2025-06-02 09:57:55 +02:00
configuration.md	chore: clarify cache_ttl to be key_recheck_period (#2220 )	2025-05-21 17:30:23 +02:00
importing_as_library.md	docs: update importing_as_library.md (#1863 )	2025-04-07 12:31:04 +02:00
index.md	docs: Updated documentation and Sphinx configuration (#1845 )	2025-03-31 13:08:05 -07:00
kubernetes_deployment.md	fix: replace all instances of --yaml-config with --config (#2196 )	2025-05-16 14:31:12 -07:00
list_of_distributions.md	docs: Updated documentation and Sphinx configuration (#1845 )	2025-03-31 13:08:05 -07:00
starting_llama_stack_server.md	docs: Update quickstart page to structure things a little more for the novices (#1873 )	2025-04-10 14:09:00 -07:00