llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Hardik Shah 999195fe5b fix: [Litellm]Do not swallow first token (#1316 ) `ChatCompletionResponseEventType: start` is ignored and not yielded in the agent_instance as we expect that to not have any content. However, litellm sends first event as `ChatCompletionResponseEventType: start` with content ( which was the first token that we were skipping ) ``` LLAMA_STACK_CONFIG=dev pytest -s -v tests/client-sdk/agents/test_agents.py --inference-model "openai/gpt-4o-mini" -k test_agent_simple ``` This was failing before ( since the word hello was not in the final response )		2025-02-27 20:53:47 -08:00
..
apis	ci: add mypy for static type checking (#1101 )	2025-02-21 13:15:40 -08:00
cli	fix: Incorrect import path for print_subcommand_description() (#1315 )	2025-02-27 18:50:41 -08:00
distribution	fix(test): update client-sdk tests to handle tool format parametrization better (#1287 )	2025-02-26 21:16:00 -08:00
models/llama	chore: move all Llama Stack types from llama-models to llama-stack (#1098 )	2025-02-14 09:10:59 -08:00
providers	fix: [Litellm]Do not swallow first token (#1316 )	2025-02-27 20:53:47 -08:00
scripts	ci: add mypy for static type checking (#1101 )	2025-02-21 13:15:40 -08:00
strong_typing	Ensure that deprecations for fields follow through to OpenAPI	2025-02-19 13:54:04 -08:00
templates	docs: update the output of llama-stack-client models list (#1271 )	2025-02-27 16:46:38 -08:00
__init__.py	export LibraryClient	2024-12-13 12:08:00 -08:00
schema_utils.py	ci: add mypy for static type checking (#1101 )	2025-02-21 13:15:40 -08:00