llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Ilya Kolchinsky 43d4447ff0 fix: remote vLLM tool execution now works when the last chunk contains the call arguments (#2112 ) # What does this PR do? Closes #2111. Fixes an error causing Llama Stack to just return `<tool_call>` and complete the turn without actually executing the tool. See the issue description for more detail. ## Test Plan 1) Ran existing unit tests 2) Added a dedicated test verifying correct behavior in this edge case 3) Ran the code snapshot from #2111	2025-05-14 11:38:00 +02:00
..
test_remote_vllm.py	fix: remote vLLM tool execution now works when the last chunk contains the call arguments (#2112 )	2025-05-14 11:38:00 +02:00

Ilya Kolchinsky 43d4447ff0

fix: remote vLLM tool execution now works when the last chunk contains the call arguments (#2112 )

# What does this PR do?
Closes #2111.
Fixes an error causing Llama Stack to just return `<tool_call>` and
complete the turn without actually executing the tool. See the issue
description for more detail.

## Test Plan
1) Ran existing unit tests
2) Added a dedicated test verifying correct behavior in this edge case
3) Ran the code snapshot from #2111

2025-05-14 11:38:00 +02:00

test_remote_vllm.py

fix: remote vLLM tool execution now works when the last chunk contains the call arguments (#2112 )

2025-05-14 11:38:00 +02:00