llama-stack-mirror/llama_stack/apis
Xi Yan 8b655e3cd2
fix!: update eval-tasks -> benchmarks (#1032)
# What does this PR do?

- Update `/eval-tasks` to `/benchmarks`
- ⚠️ Remove differentiation between `app` v.s. `benchmark` eval task
config. Now we only have `BenchmarkConfig`. The overloaded `benchmark`
is confusing and do not add any value. Backward compatibility is being
kept as the "type" is not being used anywhere.

[//]: # (If resolving an issue, uncomment and update the line below)
[//]: # (Closes #[issue-number])

## Test Plan
- This change is backward compatible 
- Run notebook test with

```
pytest -v -s --nbval-lax ./docs/getting_started.ipynb
pytest -v -s --nbval-lax ./docs/notebooks/Llama_Stack_Benchmark_Evals.ipynb
```

<img width="846" alt="image"
src="https://github.com/user-attachments/assets/d2fc06a7-593a-444f-bc1f-10ab9b0c843d"
/>



[//]: # (## Documentation)
[//]: # (- [ ] Added a Changelog entry if the change is significant)

---------

Signed-off-by: Ihar Hrachyshka <ihar.hrachyshka@gmail.com>
Signed-off-by: Ben Browning <bbrownin@redhat.com>
Signed-off-by: Sébastien Han <seb@redhat.com>
Signed-off-by: reidliu <reid201711@gmail.com>
Co-authored-by: Ihar Hrachyshka <ihar.hrachyshka@gmail.com>
Co-authored-by: Ben Browning <ben324@gmail.com>
Co-authored-by: Sébastien Han <seb@redhat.com>
Co-authored-by: Reid <61492567+reidliu41@users.noreply.github.com>
Co-authored-by: reidliu <reid201711@gmail.com>
Co-authored-by: Yuan Tang <terrytangyuan@gmail.com>
2025-02-13 16:40:58 -08:00
..
agents build: format codebase imports using ruff linter (#1028) 2025-02-13 10:06:21 -08:00
batch_inference Update OpenAPI generator to add param and field documentation (#896) 2025-01-29 10:04:30 -08:00
benchmarks fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
common build: format codebase imports using ruff linter (#1028) 2025-02-13 10:06:21 -08:00
datasetio Fix precommit check after moving to ruff (#927) 2025-02-02 06:46:45 -08:00
datasets fix: make backslash work in GET /models/{model_id:path} (#1068) 2025-02-13 08:46:43 -08:00
eval fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
inference feat: add MetricResponseMixin to chat completion response types (#1050) 2025-02-11 14:58:12 -08:00
inspect REST API fixes (#789) 2025-01-16 13:47:08 -08:00
models fix: make backslash work in GET /models/{model_id:path} (#1068) 2025-02-13 08:46:43 -08:00
post_training Fix precommit check after moving to ruff (#927) 2025-02-02 06:46:45 -08:00
safety More idiomatic REST API (#765) 2025-01-15 13:20:09 -08:00
scoring More idiomatic REST API (#765) 2025-01-15 13:20:09 -08:00
scoring_functions build: format codebase imports using ruff linter (#1028) 2025-02-13 10:06:21 -08:00
shields fix: make backslash work in GET /models/{model_id:path} (#1068) 2025-02-13 08:46:43 -08:00
synthetic_data_generation build: format codebase imports using ruff linter (#1028) 2025-02-13 10:06:21 -08:00
telemetry chore: Consistent naming for VectorIO providers (#1023) 2025-02-13 13:15:49 -05:00
tools build: format codebase imports using ruff linter (#1028) 2025-02-13 10:06:21 -08:00
vector_dbs fix: make backslash work in GET /models/{model_id:path} (#1068) 2025-02-13 08:46:43 -08:00
vector_io [memory refactor][6/n] Update naming and routes (#839) 2025-01-22 10:39:13 -08:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00
datatypes.py fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
resource.py fix!: update eval-tasks -> benchmarks (#1032) 2025-02-13 16:40:58 -08:00
version.py llama-stack version alpha -> v1 2025-01-15 05:58:09 -08:00