llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 18:00:36 +00:00

History

Yuan Tang 5966079770 fix: More robust handling of the arguments in tool call response in remote::vllm (#1169 ) # What does this PR do? This fixes the following issue on the server side when the tool call response contains empty args. This happens when running `examples.agents.e2e_loop_with_client_tools` but `get_ticker_data` returns `[]`: ``` Traceback (most recent call last): File "/home/yutang/repos/llama-stack/llama_stack/distribution/server/server.py", line 208, in sse_generator async for item in event_gen: File "/home/yutang/repos/llama-stack/llama_stack/providers/inline/agents/meta_reference/agents.py", line 169, in _create_agent_turn_streaming async for event in agent.create_and_execute_turn(request): File "/home/yutang/repos/llama-stack/llama_stack/providers/inline/agents/meta_reference/agent_instance.py", line 189, in create_and_execute_turn async for chunk in self.run( File "/home/yutang/repos/llama-stack/llama_stack/providers/inline/agents/meta_reference/agent_instance.py", line 258, in run async for res in self._run( File "/home/yutang/repos/llama-stack/llama_stack/providers/inline/agents/meta_reference/agent_instance.py", line 499, in _run async for chunk in await self.inference_api.chat_completion( File "/home/yutang/repos/llama-stack/llama_stack/distribution/routers/routers.py", line 182, in <genexpr> return (chunk async for chunk in await provider.chat_completion(**params)) File "/home/yutang/repos/llama-stack/llama_stack/providers/remote/inference/vllm/vllm.py", line 296, in _stream_chat_completion async for chunk in res: File "/home/yutang/repos/llama-stack/llama_stack/providers/remote/inference/vllm/vllm.py", line 162, in _process_vllm_chat_completion_stream_response arguments=json.loads(tool_call_buf.arguments), File "/home/yutang/.conda/envs/distribution-myenv/lib/python3.10/json/__init__.py", line 346, in loads return _default_decoder.decode(s) File "/home/yutang/.conda/envs/distribution-myenv/lib/python3.10/json/decoder.py", line 337, in decode obj, end = self.raw_decode(s, idx=_w(s, 0).end()) File "/home/yutang/.conda/envs/distribution-myenv/lib/python3.10/json/decoder.py", line 355, in raw_decode raise JSONDecodeError("Expecting value", s, err.value) from None json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0) ``` ## Test Plan All existing tests in `tests/client-sdk/inference/test_text_inference.py` passed. [//]: # (## Documentation) --------- Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>		2025-02-19 22:27:02 -08:00
..
bedrock	chore: remove llama_models.llama3.api imports from providers (#1107 )	2025-02-19 19:01:29 -08:00
cerebras	chore: remove llama_models.llama3.api imports from providers (#1107 )	2025-02-19 19:01:29 -08:00
databricks	chore: remove llama_models.llama3.api imports from providers (#1107 )	2025-02-19 19:01:29 -08:00
fireworks	chore: remove llama_models.llama3.api imports from providers (#1107 )	2025-02-19 19:01:29 -08:00
groq	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
nvidia	fix: Get distro_codegen.py working with default deps and enabled in pre-commit hooks (#1123 )	2025-02-19 18:39:20 -08:00
ollama	chore: remove llama_models.llama3.api imports from providers (#1107 )	2025-02-19 19:01:29 -08:00
passthrough	feat: inference passthrough provider (#1166 )	2025-02-19 21:47:00 -08:00
runpod	chore: remove llama_models.llama3.api imports from providers (#1107 )	2025-02-19 19:01:29 -08:00
sambanova	chore: remove llama_models.llama3.api imports from providers (#1107 )	2025-02-19 19:01:29 -08:00
sample	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
tgi	chore: remove llama_models.llama3.api imports from providers (#1107 )	2025-02-19 19:01:29 -08:00
together	chore: remove llama_models.llama3.api imports from providers (#1107 )	2025-02-19 19:01:29 -08:00
vllm	fix: More robust handling of the arguments in tool call response in remote::vllm (#1169 )	2025-02-19 22:27:02 -08:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00