llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-04 12:07:34 +00:00

History

Hardik Shah 999195fe5b fix: [Litellm]Do not swallow first token (#1316 ) `ChatCompletionResponseEventType: start` is ignored and not yielded in the agent_instance as we expect that to not have any content. However, litellm sends first event as `ChatCompletionResponseEventType: start` with content ( which was the first token that we were skipping ) ``` LLAMA_STACK_CONFIG=dev pytest -s -v tests/client-sdk/agents/test_agents.py --inference-model "openai/gpt-4o-mini" -k test_agent_simple ``` This was failing before ( since the word hello was not in the final response )		2025-02-27 20:53:47 -08:00
..
bedrock	Fix precommit check after moving to ruff (#927 )	2025-02-02 06:46:45 -08:00
common	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
datasetio	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
inference	fix: [Litellm]Do not swallow first token (#1316 )	2025-02-27 20:53:47 -08:00
kvstore	precommit	2025-02-19 22:37:41 -08:00
memory	build: format codebase imports using ruff linter (#1028 )	2025-02-13 10:06:21 -08:00
scoring	Fix precommit check after moving to ruff (#927 )	2025-02-02 06:46:45 -08:00
telemetry	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
__init__.py	API Updates (#73 )	2024-09-17 19:51:35 -07:00