phoenix-oss/llama-stack-mirror

Fork 1

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-04 12:07:34 +00:00

Matthew Farrellee cffc4edf47

Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 4s

Details

Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped

Details

Integration Tests (Replay) / Integration Tests (, , , client=, vision=) (push) Failing after 0s

Details

Test Llama Stack Build / build-single-provider (push) Failing after 2s

Details

Pre-commit / pre-commit (push) Failing after 4s

Details

SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 5s

Details

Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 3s

Details

Test Llama Stack Build / generate-matrix (push) Failing after 5s

Details

Test Llama Stack Build / build (push) Has been skipped

Details

Vector IO Integration Tests / test-matrix (push) Failing after 6s

Details

Test Llama Stack Build / build-custom-container-distribution (push) Failing after 5s

Details

Python Package Build Test / build (3.13) (push) Failing after 4s

Details

Test External API and Providers / test-external (venv) (push) Failing after 4s

Details

Unit Tests / unit-tests (3.12) (push) Failing after 4s

Details

Update ReadTheDocs / update-readthedocs (push) Failing after 4s

Details

Python Package Build Test / build (3.12) (push) Failing after 7s

Details

Unit Tests / unit-tests (3.13) (push) Failing after 5s

Details

UI Tests / ui-tests (22) (push) Failing after 6s

Details

SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 14s

Details

feat: Add optional idempotency support to batches API (#3171 )

Implements optional idempotency for batch creation using `idem_tok`
parameter:

* **Core idempotency**: Same token + parameters returns existing batch
* **Conflict detection**: Same token + different parameters raises HTTP
409 ConflictError
* **Metadata order independence**: Different key ordering doesn't affect
idempotency

**API changes:**
- Add optional `idem_tok` parameter to `create_batch()` method
- Enhanced API documentation with idempotency extensions

**Implementation:**
- Reference provider supports idempotent batch creation
- ConflictError for proper HTTP 409 status code mapping
- Comprehensive parameter validation

**Testing:**
- Unit tests: focused tests covering core scenarios with parametrized
conflict detection
- Integration tests: tests validating real OpenAI client behavior

This enables client-side retry safety and prevents duplicate batch
creation when using the same idempotency token, following REST API

closes #3144

2025-08-22 15:50:40 -07:00

648 B

Raw Blame History

Batches

Overview

The Batches API enables efficient processing of multiple requests in a single operation, particularly useful for processing large datasets, batch evaluation workflows, and cost-effective inference at scale.

The API is designed to allow use of openai client libraries for seamless integration.

This API provides the following extensions:
 - idempotent batch creation

Note: This API is currently under active development and may undergo changes.

This section contains documentation for all available providers for the batches API.

Providers

:maxdepth: 1

inline_reference

648 B Raw Blame History

Batches

Overview

Providers

648 B

Raw Blame History