Update llama_stack/apis/evaluation/evaluation.py

Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>
2025-03-18 20:16:15 -07:00 · 2025-03-18 20:16:15 -07:00 · 820b9a00c7
commit 820b9a00c7
parent 85cad639ca
1 changed files with 1 additions and 1 deletions
--- a/llama_stack/apis/evaluation/evaluation.py
+++ b/llama_stack/apis/evaluation/evaluation.py
@ -53,7 +53,7 @@ class EvaluationTask(BaseModel):
    A task for evaluation. To specify a task, one of the following must be provided:
    - `benchmark_id`: Run evaluation task against a benchmark_id. Use this when you have a curated dataset and have settled on the graders.
    - `dataset_id` and `grader_ids`: Run evaluation task against a dataset_id and a list of grader_ids
-    - `data_source` and `grader_ids`: Run evaluation task against a data source (e.g. rows, uri, etc.) and a list of grader_ids
+    - `data_source` and `grader_ids`: Run evaluation task against a data source (e.g. rows, uri, etc.) and a list of grader_ids. Prefer this when you are early in your evaluation cycle and experimenting much more with your data and graders.

    :param benchmark_id: The benchmark ID to evaluate.
    :param dataset_id: The dataset ID to evaluate.