llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-28 01:40:24 +00:00

History

Ben Browning fd9d52564b fix: resolve BuiltinTools to strings for vllm tool_call messages When the result of a ToolCall gets passed back into vLLM for the model to handle the tool call result (as is often the case in agentic tool-calling workflows), we forgot to handle the case where BuiltinTool calls are not string values but instead instances of the BuiltinTool enum. This fixes that, properly converting those enums to string values before trying to serialize them into an OpenAI chat completion request to vLLM. PR #1931 fixed a bug where we weren't passing these tool calling results back into vLLM, but as a side-effect it created this serialization bug when using BuiltinTools. Resolves #2070 Signed-off-by: Ben Browning <bbrownin@redhat.com>		2025-04-30 20:10:33 -04:00
..
__init__.py	refactor: move all llama code to models/llama out of meta reference (#1887 )	2025-04-07 15:03:58 -07:00
embedding_mixin.py	fix: dont assume SentenceTransformer is imported	2025-02-25 16:53:01 -08:00
litellm_openai_mixin.py	feat: add api.llama provider, llama-guard-4 model (#2058 )	2025-04-29 10:07:41 -07:00
model_registry.py	test: verification on provider's OAI endpoints (#1893 )	2025-04-07 23:06:28 -07:00
openai_compat.py	fix: resolve BuiltinTools to strings for vllm tool_call messages	2025-04-30 20:10:33 -04:00
prompt_adapter.py	feat: new system prompt for llama4 (#2031 )	2025-04-25 11:29:08 -07:00