llama-stack-mirror/docs/source
ehhuang d6ae54723d
chore: setup for performance benchmarking (#3096)
# What does this PR do?
1. Added a simple mock openai-compat server that serves chat/completion
2. Add a benchmark server in EKS that includes mock inference server
3. Add locust (https://locust.io/) file for load testing

## Test Plan
bash apply.sh
kubectl port-forward service/locust-web-ui 8089:8089
Go to localhost:8089 to start a load test

<img width="1392" height="334" alt="image"
src="https://github.com/user-attachments/assets/d6aa3deb-583a-42ed-889b-751262b8e91c"
/>
<img width="1362" height="881" alt="image"
src="https://github.com/user-attachments/assets/6a28b9b4-05e6-44e2-b504-07e60c12d35e"
/>
2025-08-13 10:58:22 -07:00
..
advanced_apis chore: rename templates to distributions (#3035) 2025-08-04 11:34:17 -07:00
apis chore: bump min python version in docs and tests (#3103) 2025-08-12 08:52:57 -07:00
building_applications docs: Update blocks formatting in docs/source files (#3120) 2025-08-13 08:06:31 -07:00
concepts docs: update list of apis (#2697) 2025-07-24 09:50:14 -07:00
contributing docs: reorganize contributor guide (#3110) 2025-08-12 16:17:03 -07:00
deploying chore(rename): move llama_stack.distribution to llama_stack.core (#2975) 2025-07-30 23:30:53 -07:00
distributions chore: setup for performance benchmarking (#3096) 2025-08-13 10:58:22 -07:00
getting_started docs: Added comment about a known limitation of AgentEventLogger (#2930) 2025-08-07 10:09:57 -07:00
providers docs: Update blocks formatting in docs/source files (#3120) 2025-08-13 08:06:31 -07:00
references docs: Update blocks formatting in docs/source files (#3120) 2025-08-13 08:06:31 -07:00
conf.py docs: Add documentation on how to contribute a Vector DB provider and update testing documentation (#3093) 2025-08-11 11:11:09 -07:00
index.md docs: Reorganize documentation on the webpage (#2651) 2025-07-15 14:19:35 -07:00