llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 09:53:45 +00:00

History

Xi Yan 8b655e3cd2 fix!: update eval-tasks -> benchmarks (#1032 ) # What does this PR do? - Update `/eval-tasks` to `/benchmarks` - ⚠️ Remove differentiation between `app` v.s. `benchmark` eval task config. Now we only have `BenchmarkConfig`. The overloaded `benchmark` is confusing and do not add any value. Backward compatibility is being kept as the "type" is not being used anywhere. [//]: # (If resolving an issue, uncomment and update the line below) [//]: # (Closes #[issue-number]) ## Test Plan - This change is backward compatible - Run notebook test with ``` pytest -v -s --nbval-lax ./docs/getting_started.ipynb pytest -v -s --nbval-lax ./docs/notebooks/Llama_Stack_Benchmark_Evals.ipynb ``` <img width="846" alt="image" src="https://github.com/user-attachments/assets/d2fc06a7-593a-444f-bc1f-10ab9b0c843d" /> [//]: # (## Documentation) [//]: # (- [ ] Added a Changelog entry if the change is significant) --------- Signed-off-by: Ihar Hrachyshka <ihar.hrachyshka@gmail.com> Signed-off-by: Ben Browning <bbrownin@redhat.com> Signed-off-by: Sébastien Han <seb@redhat.com> Signed-off-by: reidliu <reid201711@gmail.com> Co-authored-by: Ihar Hrachyshka <ihar.hrachyshka@gmail.com> Co-authored-by: Ben Browning <ben324@gmail.com> Co-authored-by: Sébastien Han <seb@redhat.com> Co-authored-by: Reid <61492567+reidliu41@users.noreply.github.com> Co-authored-by: reidliu <reid201711@gmail.com> Co-authored-by: Yuan Tang <terrytangyuan@gmail.com>		2025-02-13 16:40:58 -08:00
..
agents	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
batch_inference	Update OpenAPI generator to add param and field documentation (#896 )	2025-01-29 10:04:30 -08:00
benchmarks	fix!: update eval-tasks -> benchmarks (#1032 )	2025-02-13 16:40:58 -08:00
common	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
datasetio	Fix precommit check after moving to ruff (#927 )	2025-02-02 06:46:45 -08:00
datasets	fix: make backslash work in GET /models/{model_id:path} (#1068 )	2025-02-13 08:46:43 -08:00
eval	fix!: update eval-tasks -> benchmarks (#1032 )	2025-02-13 16:40:58 -08:00
inference	feat: add MetricResponseMixin to chat completion response types (#1050 )	2025-02-11 14:58:12 -08:00
inspect	REST API fixes (#789 )	2025-01-16 13:47:08 -08:00
models	fix: make backslash work in GET /models/{model_id:path} (#1068 )	2025-02-13 08:46:43 -08:00
post_training	Fix precommit check after moving to ruff (#927 )	2025-02-02 06:46:45 -08:00
safety	More idiomatic REST API (#765 )	2025-01-15 13:20:09 -08:00
scoring	More idiomatic REST API (#765 )	2025-01-15 13:20:09 -08:00
scoring_functions	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
shields	fix: make backslash work in GET /models/{model_id:path} (#1068 )	2025-02-13 08:46:43 -08:00
synthetic_data_generation	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
telemetry	chore: Consistent naming for VectorIO providers (#1023 )	2025-02-13 13:15:49 -05:00
tools	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
vector_dbs	fix: make backslash work in GET /models/{model_id:path} (#1068 )	2025-02-13 08:46:43 -08:00
vector_io	[memory refactor][6/n] Update naming and routes (#839 )	2025-01-22 10:39:13 -08:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
datatypes.py	fix!: update eval-tasks -> benchmarks (#1032 )	2025-02-13 16:40:58 -08:00
resource.py	fix!: update eval-tasks -> benchmarks (#1032 )	2025-02-13 16:40:58 -08:00
version.py	llama-stack version alpha -> v1	2025-01-15 05:58:09 -08:00