llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 09:53:45 +00:00

History

Anastas Stoyanovsky a3580e6bc0 feat!: Wire through parallel_tool_calls to Responses API (#4124 ) # What does this PR do? Initial PR against #4123 Adds `parallel_tool_calls` spec to Responses API and basic initial implementation where no more than one function call is generated when set to `False`. ## Test Plan * Unit tests have been added to verify no more than one function call is generated. * A followup PR will verify passing through `parallel_tool_calls` to providers. * A followup PR will address verification and/or implementation of incremental function calling across multiple conversational turns. --------- Signed-off-by: Anastas Stoyanovsky <astoyano@redhat.com>		2025-11-18 11:25:08 -08:00
..
cli	fix: rename llama_stack_api dir (#4155 )	2025-11-13 15:04:36 -08:00
core	feat(openapi): switch to fastapi-based generator (#3944 )	2025-11-14 15:53:53 -08:00
distributions	fix: rename llama_stack_api dir (#4155 )	2025-11-13 15:04:36 -08:00
models	refactor: remove dead inference API code and clean up imports (#4093 )	2025-11-10 15:29:24 -08:00
providers	feat!: Wire through parallel_tool_calls to Responses API (#4124 )	2025-11-18 11:25:08 -08:00
testing	fix: MCP authorization parameter implementation (#4052 )	2025-11-14 08:54:42 -08:00
__init__.py	chore: Stack server no longer depends on llama-stack-client (#4094 )	2025-11-07 09:54:09 -08:00
env.py	chore(package): migrate to src/ layout (#3920 )	2025-10-27 12:02:21 -07:00
log.py	chore(package): migrate to src/ layout (#3920 )	2025-10-27 12:02:21 -07:00