llama-stack

forked from phoenix-oss/llama-stack-mirror

History

Ben Browning 0b6cd45950 fix: Additional streaming error handling (#2007 ) # What does this PR do? This expands the `test_sse` test suite and fixes some edge cases with bugs in our SSE error handling to ensure streaming clients always get a proper error response. First, we handle the case where a client disconnects before we actually start streaming the response back. Previously we only handled the case where a client disconnected as we were streaming the response, but there was an edge case where a client disconnecting before we streamed any response back did not trigger our logic to cleanly handle that disconnect. Second, we handle the case where an error is thrown from the server before the actual async generator gets created from the provider. This happens in scenarios like the newly merged OpenAI API input validation, where we eagerly raise validation errors before returning the async generator object that streams the responses back. ## Test Plan Tested via: ``` python -m pytest -s -v tests/unit/server/test_sse.py ``` Both test cases failed before, and passed afterwards. The test cases were written based on me experimenting with actual clients that would do bad things like randomly disconnect or send invalid input in streaming mode and I hit these two cases, where things were misbehaving in our error handling. Signed-off-by: Ben Browning <bbrownin@redhat.com>		2025-04-24 17:01:45 -07:00
..
client-sdk/post_training	feat: Add nemo customizer (#1448 )	2025-03-25 11:01:10 -07:00
external-provider/llama-stack-provider-ollama	feat: allow building distro with external providers (#1967 )	2025-04-18 17:18:28 +02:00
integration	feat(agents): add agent naming functionality (#1922 )	2025-04-17 07:02:47 -07:00
unit	fix: Additional streaming error handling (#2007 )	2025-04-24 17:01:45 -07:00
verifications	fix: Return HTTP 400 for OpenAI API validation errors (#2002 )	2025-04-23 17:48:32 +02:00
__init__.py	refactor(test): introduce --stack-config and simplify options (#1404 )	2025-03-05 17:02:02 -08:00