llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 09:53:45 +00:00

History

Derek Higgins 1562277cfd ci: test adjustments for Qwen3-0.6B (#3978 ) Without this hint Qwen3-0.6B tends to reply with the full name and sometimes doesn't reply with the correct drafted year. --------- Signed-off-by: Derek Higgins <derekh@redhat.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>		2025-11-03 12:19:35 -08:00
..
recordings	ci: test adjustments for Qwen3-0.6B (#3978 )	2025-11-03 12:19:35 -08:00
__init__.py	feat: add batches API with OpenAI compatibility (with inference replay) (#3162 )	2025-08-15 15:34:15 -07:00
conftest.py	fix(perf): make batches tests finish 30x faster (#3834 )	2025-10-17 09:16:44 +02:00
test_batches.py	feat: Add /v1/embeddings endpoint to batches API (#3384 )	2025-10-10 13:25:58 -07:00
test_batches_errors.py	feat: add batches API with OpenAI compatibility (with inference replay) (#3162 )	2025-08-15 15:34:15 -07:00
test_batches_idempotency.py	feat: Add optional idempotency support to batches API (#3171 )	2025-08-22 15:50:40 -07:00