llama-stack-mirror/src/llama_stack
Anastas Stoyanovsky a3580e6bc0
feat!: Wire through parallel_tool_calls to Responses API (#4124)
# What does this PR do?
Initial PR against #4123
Adds `parallel_tool_calls` spec to Responses API and basic initial
implementation where no more than one function call is generated when
set to `False`.

## Test Plan
* Unit tests have been added to verify no more than one function call is
generated.
* A followup PR will verify passing through `parallel_tool_calls` to
providers.
* A followup PR will address verification and/or implementation of
incremental function calling across multiple conversational turns.

---------

Signed-off-by: Anastas Stoyanovsky <astoyano@redhat.com>
2025-11-18 11:25:08 -08:00
..
cli fix: rename llama_stack_api dir (#4155) 2025-11-13 15:04:36 -08:00
core feat(openapi): switch to fastapi-based generator (#3944) 2025-11-14 15:53:53 -08:00
distributions fix: rename llama_stack_api dir (#4155) 2025-11-13 15:04:36 -08:00
models refactor: remove dead inference API code and clean up imports (#4093) 2025-11-10 15:29:24 -08:00
providers feat!: Wire through parallel_tool_calls to Responses API (#4124) 2025-11-18 11:25:08 -08:00
testing fix: MCP authorization parameter implementation (#4052) 2025-11-14 08:54:42 -08:00
__init__.py chore: Stack server no longer depends on llama-stack-client (#4094) 2025-11-07 09:54:09 -08:00
env.py chore(package): migrate to src/ layout (#3920) 2025-10-27 12:02:21 -07:00
log.py chore(package): migrate to src/ layout (#3920) 2025-10-27 12:02:21 -07:00