llama-stack-mirror/llama_stack/apis/datasets
Xi Yan b50bffd15c datasets api
## Summary
- Clean up datasets API to work with uri & files.

- Example client usage
```python
client.datasets.create(
    uri="file://mydata.jsonl",
    schema="jsonl_messages",
    # optional
    dataset_id="my_first_train_data"
)

# ecosystem enablement ( get huggingface dataset easily )
client.datasets.create(
    uri="huggingface://tatsu-lab/alpaca",
    uri_params={"split": "train"},
    schema="jsonl_messages",
    metadata={"label": "v/finetuning/test"},
)

# [ in the future with stronger story for authentication ]
client.datasets.create(
    uri="s3://bucket/mydata.jsonl", <-- signed s3 url
    uri_parms={...},
    schema="jsonl_messages",
)
```


## Test Plan:
- Only API spec updates, implementation will be updated in follow PRs.
2025-03-11 14:54:04 -07:00
..
__init__.py [Evals API] [1/n] Initial API (#287) 2024-10-22 09:31:19 -07:00
datasets.py datasets api 2025-03-11 14:54:04 -07:00