Merge branch 'pr1573' into api_2

This commit is contained in:
Xi Yan 2025-03-12 23:36:03 -07:00
commit f90dcd2a69
3 changed files with 31 additions and 20 deletions

View file

@ -4716,10 +4716,10 @@ components:
type: string
enum:
- post-training/messages
- eval/question-answer
- eval/messages-answer
title: DatasetPurpose
description: >-
Purpose of the dataset. Each type has a different column format.
Purpose of the dataset. Each purpose has a required input data schema.
source:
$ref: '#/components/schemas/DataSource'
metadata:
@ -6776,20 +6776,21 @@ components:
type: string
enum:
- post-training/messages
- eval/question-answer
- eval/messages-answer
description: >-
The purpose of the dataset. One of - "post-training/messages": The dataset
contains a messages column with list of messages for post-training. -
"eval/question-answer": The dataset contains a question and answer column.
"eval/messages-answer": The dataset contains a messages column with list
of messages and an answer column.
source:
$ref: '#/components/schemas/DataSource'
description: >-
The data source of the dataset. Examples: - { "type": "uri", "uri": "https://mywebsite.com/mydata.jsonl"
} - { "type": "uri", "uri": "lsfs://mydata.jsonl" } - { "type": "huggingface",
"dataset_path": "tatsu-lab/alpaca", "params": { "split": "train" } } -
{ "type": "rows", "rows": [ { "messages": [ {"role": "user", "content":
"Hello, world!"}, {"role": "assistant", "content": "Hello, world!"}, ]
} ] }
"huggingface": { "dataset_path": "tatsu-lab/alpaca", "params": { "split":
"train" } } } - { "type": "rows", "rows": [ { "messages": [ {"role": "user",
"content": "Hello, world!"}, {"role": "assistant", "content": "Hello,
world!"}, ] } ] }
metadata:
type: object
additionalProperties: