llama-stack/llama_stack/providers/utils
ehhuang 0b695538af
fix: chat completion with more than one choice (#2288)
# What does this PR do?
Fix a bug in openai_compat where choices are not indexed correctly.

## Test Plan
Added a new test.

Rerun the failed inference_store tests:
llama stack run fireworks --image-type conda
pytest -s -v tests/integration/ --stack-config http://localhost:8321 -k
'test_inference_store' --text-model meta-llama/Llama-3.3-70B-Instruct
--count 10
2025-05-27 15:39:15 -07:00
..
bedrock chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
common chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
datasetio chore(refact): move paginate_records fn outside of datasetio (#2137) 2025-05-12 10:56:14 -07:00
inference fix: chat completion with more than one choice (#2288) 2025-05-27 15:39:15 -07:00
kvstore feat: implementation for agent/session list and describe (#1606) 2025-05-07 14:49:23 +02:00
memory feat(sqlite-vec): enable keyword search for sqlite-vec (#1439) 2025-05-21 15:24:24 -04:00
responses feat: add responses input items api (#2239) 2025-05-24 07:05:53 -07:00
scoring chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
sqlstore feat: implement get chat completions APIs (#2200) 2025-05-21 22:21:52 -07:00
telemetry feat: Propagate W3C trace context headers from clients (#2153) 2025-05-19 18:56:54 -07:00
tools fix: match mcp headers in provider data to Responses API shape (#2263) 2025-05-25 14:33:10 -07:00
__init__.py API Updates (#73) 2024-09-17 19:51:35 -07:00
pagination.py chore(refact): move paginate_records fn outside of datasetio (#2137) 2025-05-12 10:56:14 -07:00
scheduler.py chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00