History

Ilya Kolchinsky 5052c3cbf3 fix: Fixed an "out of token budget" error when attempting a tool call via remote vLLM provider (#2114 ) # What does this PR do? Closes #2113. Closes #1783. Fixes a bug in handling the end of tool execution request stream where no `finish_reason` is provided by the model. ## Test Plan 1. Ran existing unit tests 2. Added a dedicated test verifying correct behavior in this edge case 3. Ran the code snapshot from #2113 [//]: # (## Documentation)		2025-05-14 13:11:02 -07:00
..
client-sdk/post_training	feat: Add nemo customizer (#1448 )	2025-03-25 11:01:10 -07:00
external-provider/llama-stack-provider-ollama	chore: remove last instances of code-interpreter provider (#2143 )	2025-05-12 10:54:43 -07:00
integration	chore: remove pytest reports (#2156 )	2025-05-13 22:40:15 -07:00
unit	fix: Fixed an "out of token budget" error when attempting a tool call via remote vLLM provider (#2114 )	2025-05-14 13:11:02 -07:00
verifications	fix: Make search tool talk about models (#2151 )	2025-05-13 22:41:51 -07:00
__init__.py	refactor(test): introduce --stack-config and simplify options (#1404 )	2025-03-05 17:02:02 -08:00
README.md	docs: revamp testing documentation (#2155 )	2025-05-13 11:28:29 -07:00

Llama Stack Tests

Llama Stack has multiple layers of testing done to ensure continuous functionality and prevent regressions to the codebase.