llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-27 23:31:59 +00:00

History

Ben Browning e2960e9e44 fix: inference providers still using tools with `tool_choice="none"` In our OpenAI API verification tests, some providers were still calling tools even when `tool_choice="none"` was passed in the chat completion requests. Because they aren't all respecting `tool_choice` properly, this adjusts our routing implementation to remove the `tools` and `tool_choice` from the request if `tool_choice="none"` is passed in so that it does not attempt to call any of those tools. Adjusting this in the router fixes this across all providers. This also cleans up the non-streaming chat completion responses for tools, ensuring it returns `None` instead of an empty list when there are no tool calls, to exactly match the OpenAI API responses in that case. I observed existing failures in our OpenAI API verification suite - see https://github.com/bbrowning/llama-stack-tests/blob/main/openai-api-verification/2025-04-27.md#together-llama-stack for the failing `test_chat_*_tool_choice_none` tests. All streaming and non-streaming variants were failing across all 3 tested models. After this change, all of those 6 failing tests are now passing with no regression in the other tests. I verified this via: ``` llama stack run --image-type venv \ tests/verifications/openai-api-verification-run.yaml ``` ``` python -m pytest -s -v \ 'tests/verifications/openai_api/test_chat_completion.py' \ --provider=together-llama-stack ``` The entire verification suite is not 100% on together.ai yet, but it's getting closer. This also increased the pass rate for fireworks.ai, and did not regress the groq or openai tests at all. Signed-off-by: Ben Browning <bbrownin@redhat.com>		2025-04-30 13:27:28 -04:00
..
routers	fix: inference providers still using tools with `tool_choice="none"`	2025-04-30 13:27:28 -04:00
server	feat: Add Kubernetes authentication (#1778 )	2025-04-28 22:24:58 +02:00
store	fix: handle registry errors gracefully (#1732 )	2025-03-20 15:24:07 -07:00
ui	fix: tools page on playground resets agent after every interaction (#2044 )	2025-04-28 23:13:27 +02:00
utils	feat: add health to all providers through providers endpoint (#1418 )	2025-04-14 11:59:36 +02:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00
access_control.py	feat: make sure agent sessions are under access control (#1737 )	2025-03-21 07:31:16 -07:00
build.py	fix: enforce stricter ASCII rules lint rules in Ruff (#2062 )	2025-04-30 18:05:27 +02:00
build_conda_env.sh	chore: remove straggler references to llama-models (#1345 )	2025-03-01 14:26:03 -08:00
build_container.sh	feat: include run.yaml in the container image (#2005 )	2025-04-24 11:29:53 +02:00
build_venv.sh	chore: remove straggler references to llama-models (#1345 )	2025-03-01 14:26:03 -08:00
client.py	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
common.sh	fix: Fixing some small issues with the build scripts (#1132 )	2025-02-19 22:20:49 -08:00
configure.py	feat: add provider API for listing and inspecting provider info (#1429 )	2025-03-13 15:07:21 -07:00
datatypes.py	feat: Add Kubernetes authentication (#1778 )	2025-04-28 22:24:58 +02:00
distribution.py	feat: allow building distro with external providers (#1967 )	2025-04-18 17:18:28 +02:00
inspect.py	feat: add health to all providers through providers endpoint (#1418 )	2025-04-14 11:59:36 +02:00
library_client.py	feat: add health to all providers through providers endpoint (#1418 )	2025-04-14 11:59:36 +02:00
providers.py	feat: add health to all providers through providers endpoint (#1418 )	2025-04-14 11:59:36 +02:00
request_headers.py	feat(server): add attribute based access control for resources (#1703 )	2025-03-19 21:28:52 -07:00
resolver.py	feat: add health to all providers through providers endpoint (#1418 )	2025-04-14 11:59:36 +02:00
stack.py	feat: add health to all providers through providers endpoint (#1418 )	2025-04-14 11:59:36 +02:00
start_stack.sh	docs: Update docs and fix warning in start-stack.sh (#1937 )	2025-04-11 16:26:17 -07:00