mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-16 21:19:27 +00:00

History

Ken Dreyer 94e977c257 fix(docs): link to test replay-record docs for discoverability (#4134 ) Help users find the comprehensive integration testing docs by linking to the record-replay documentation. This clarifies that the technical README complements the main docs.		2025-11-12 10:04:56 -08:00
..
README.md	fix(docs): link to test replay-record docs for discoverability (#4134 )	2025-11-12 10:04:56 -08:00

Test Recording System

This directory contains recorded inference API responses used for deterministic testing without requiring live API access.

For more information, see the docs. This README provides more technical information.

Structure

responses/ - JSON files containing request/response pairs for inference operations

Each JSON file contains:

To reduce noise in git diffs, the recording system automatically normalizes fields that vary between runs but don't affect test behavior:

These normalizations ensure that re-recording tests produces minimal git diffs, making it easier to review actual changes to test behavior.

Responses are replayed from recordings:

LLAMA_STACK_TEST_INFERENCE_MODE=replay pytest tests/integration/

Records only when no recording exists, otherwise replays. Use this for iterative development:

LLAMA_STACK_TEST_INFERENCE_MODE=record-if-missing pytest tests/integration/

Force-records all API interactions, overwriting existing recordings. Use with caution:

LLAMA_STACK_TEST_INFERENCE_MODE=record pytest tests/integration/

Skip recordings entirely and use live APIs:

LLAMA_STACK_TEST_INFERENCE_MODE=live pytest tests/integration/

If you need to apply normalization to existing recordings (e.g., after updating the normalization logic):

python scripts/normalize_recordings.py

Use --dry-run to preview changes without modifying files.