llama-stack/tests/unit/providers/inference
Ilya Kolchinsky 43d4447ff0
fix: remote vLLM tool execution now works when the last chunk contains the call arguments (#2112)
# What does this PR do?
Closes #2111.
Fixes an error causing Llama Stack to just return `<tool_call>` and
complete the turn without actually executing the tool. See the issue
description for more detail.

## Test Plan
1) Ran existing unit tests
2) Added a dedicated test verifying correct behavior in this edge case
3) Ran the code snapshot from #2111
2025-05-14 11:38:00 +02:00
..
test_remote_vllm.py fix: remote vLLM tool execution now works when the last chunk contains the call arguments (#2112) 2025-05-14 11:38:00 +02:00