llama-stack-mirror/llama_stack/providers/utils/inference
Ben Browning fd9d52564b fix: resolve BuiltinTools to strings for vllm tool_call messages
When the result of a ToolCall gets passed back into vLLM for the model
to handle the tool call result (as is often the case in agentic
tool-calling workflows), we forgot to handle the case where
BuiltinTool calls are not string values but instead instances of the
BuiltinTool enum. This fixes that, properly converting those enums to
string values before trying to serialize them into an OpenAI chat
completion request to vLLM.

PR #1931 fixed a bug where we weren't passing these tool calling
results back into vLLM, but as a side-effect it created this
serialization bug when using BuiltinTools.

Resolves #2070

Signed-off-by: Ben Browning <bbrownin@redhat.com>
2025-04-30 20:10:33 -04:00
..
__init__.py refactor: move all llama code to models/llama out of meta reference (#1887) 2025-04-07 15:03:58 -07:00
embedding_mixin.py fix: dont assume SentenceTransformer is imported 2025-02-25 16:53:01 -08:00
litellm_openai_mixin.py feat: add api.llama provider, llama-guard-4 model (#2058) 2025-04-29 10:07:41 -07:00
model_registry.py test: verification on provider's OAI endpoints (#1893) 2025-04-07 23:06:28 -07:00
openai_compat.py fix: resolve BuiltinTools to strings for vllm tool_call messages 2025-04-30 20:10:33 -04:00
prompt_adapter.py feat: new system prompt for llama4 (#2031) 2025-04-25 11:29:08 -07:00