update EvaluationTask

This commit is contained in:
Xi Yan 2025-03-18 19:30:01 -07:00
parent f107e3229b
commit d994499f09
3 changed files with 32 additions and 5 deletions

View file

@ -5908,16 +5908,26 @@ components:
properties:
benchmark_id:
type: string
description: The benchmark ID to evaluate.
dataset_id:
type: string
description: The dataset ID to evaluate.
data_source:
$ref: '#/components/schemas/DataSource'
description: The data source to evaluate.
grader_ids:
type: array
items:
type: string
description: The grader IDs to evaluate.
additionalProperties: false
title: EvaluationTask
description: >-
A task for evaluation. To specify a task, one of the following must be provided:
- `benchmark_id`: Run evaluation task against a benchmark_id - `dataset_id`
and `grader_ids`: Run evaluation task against a dataset_id and a list of grader_ids
- `data_source` and `grader_ids`: Run evaluation task against a data source
(e.g. rows, uri, etc.) and a list of grader_ids
GradeRequest:
type: object
properties: