llama-stack-mirror/tests/integration/batches
Derek Higgins 1562277cfd
ci: test adjustments for Qwen3-0.6B (#3978)
Without this hint Qwen3-0.6B tends to reply with the full name
and sometimes doesn't reply with the correct drafted year.

---------

Signed-off-by: Derek Higgins <derekh@redhat.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>
2025-11-03 12:19:35 -08:00
..
recordings ci: test adjustments for Qwen3-0.6B (#3978) 2025-11-03 12:19:35 -08:00
__init__.py feat: add batches API with OpenAI compatibility (with inference replay) (#3162) 2025-08-15 15:34:15 -07:00
conftest.py fix(perf): make batches tests finish 30x faster (#3834) 2025-10-17 09:16:44 +02:00
test_batches.py feat: Add /v1/embeddings endpoint to batches API (#3384) 2025-10-10 13:25:58 -07:00
test_batches_errors.py feat: add batches API with OpenAI compatibility (with inference replay) (#3162) 2025-08-15 15:34:15 -07:00
test_batches_idempotency.py feat: Add optional idempotency support to batches API (#3171) 2025-08-22 15:50:40 -07:00