llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Botao Chen 4dccf916d1 feat: open benchmark template and doc (#1465 ) ## What does this PR do? - Provide a distro template to let developer easily run the open benchmarks llama stack supports on llama and non-llama models. - Provide doc on how to run open benchmark eval via CLI and open benchmark contributing guide [//]: # (If resolving an issue, uncomment and update the line below) (Closes #1375 ) ## Test Plan open benchmark eval results on llama, gpt, gemini and clause <img width="771" alt="Screenshot 2025-03-06 at 7 33 05 PM" src="https://github.com/user-attachments/assets/1bd85456-b9b9-4b37-af76-4ce1d2bac00e" /> doc preview <img width="944" alt="Screenshot 2025-03-06 at 7 33 58 PM" src="https://github.com/user-attachments/assets/f4e5866d-b395-4c40-aa8b-080edeb5cdb6" /> <img width="955" alt="Screenshot 2025-03-06 at 7 34 04 PM" src="https://github.com/user-attachments/assets/629defb6-d5e4-473c-aa03-308bce386fb4" /> <img width="965" alt="Screenshot 2025-03-06 at 7 35 29 PM" src="https://github.com/user-attachments/assets/c21ff96c-9e8c-4c54-b6b8-25883125f4cf" /> <img width="957" alt="Screenshot 2025-03-06 at 7 35 37 PM" src="https://github.com/user-attachments/assets/47571c90-1381-4e2c-bbed-c4f3a60578d0" />		2025-03-07 10:37:55 -08:00
..
bedrock	refactor(test): unify vector_io tests and make them configurable (#1398 )	2025-03-04 13:37:45 -08:00
cerebras	refactor(test): unify vector_io tests and make them configurable (#1398 )	2025-03-04 13:37:45 -08:00
ci-tests	refactor(test): unify vector_io tests and make them configurable (#1398 )	2025-03-04 13:37:45 -08:00
dell	chore: remove straggler references to llama-models (#1345 )	2025-03-01 14:26:03 -08:00
dev	refactor(test): unify vector_io tests and make them configurable (#1398 )	2025-03-04 13:37:45 -08:00
experimental-post-training	feat: [post training] support save hf safetensor format checkpoint (#845 )	2025-02-25 23:29:08 -08:00
fireworks	refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401 )	2025-03-04 14:53:47 -08:00
groq	fix: register provider model name and HF alias in run.yaml (#1304 )	2025-02-27 16:39:23 -08:00
hf-endpoint	refactor(test): unify vector_io tests and make them configurable (#1398 )	2025-03-04 13:37:45 -08:00
hf-serverless	refactor(test): unify vector_io tests and make them configurable (#1398 )	2025-03-04 13:37:45 -08:00
meta-reference-gpu	refactor(test): unify vector_io tests and make them configurable (#1398 )	2025-03-04 13:37:45 -08:00
meta-reference-quantized-gpu	refactor(test): unify vector_io tests and make them configurable (#1398 )	2025-03-04 13:37:45 -08:00
nvidia	fix: register provider model name and HF alias in run.yaml (#1304 )	2025-02-27 16:39:23 -08:00
ollama	refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401 )	2025-03-04 14:53:47 -08:00
open-benchmark	feat: open benchmark template and doc (#1465 )	2025-03-07 10:37:55 -08:00
passthrough	feat: inference passthrough provider (#1166 )	2025-02-19 21:47:00 -08:00
remote-vllm	refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401 )	2025-03-04 14:53:47 -08:00
sambanova	refactor(test): unify vector_io tests and make them configurable (#1398 )	2025-03-04 13:37:45 -08:00
tgi	refactor(test): unify vector_io tests and make them configurable (#1398 )	2025-03-04 13:37:45 -08:00
together	refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401 )	2025-03-04 14:53:47 -08:00
vllm-gpu	refactor(test): unify vector_io tests and make them configurable (#1398 )	2025-03-04 13:37:45 -08:00
__init__.py	Auto-generate distro yamls + docs (#468 )	2024-11-18 14:57:06 -08:00
template.py	refactor(test): unify vector_io tests and make them configurable (#1398 )	2025-03-04 13:37:45 -08:00