llama-stack-mirror/docs/source/providers/batches/index.md

624 B

Batches

Overview

The Batches API enables efficient processing of multiple requests in a single operation, particularly useful for processing large datasets, batch evaluation workflows, and cost-effective inference at scale.

The API is designed to allow use of openai client libraries for seamless integration.

This API provides the following extensions:

  • idempotent batch creation

Note: This API is currently under active development and may undergo changes.

This section contains documentation for all available providers for the batches API.

Providers

:maxdepth: 1

inline_reference