llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-06-27 18:50:41 +00:00

History

Ben Browning dc46725f56 fix: properly handle streaming client disconnects (#2000 ) # What does this PR do? Previously, when a streaming client would disconnect before we were finished streaming the entire response, an error like the below would get raised from the `sse_generator` function in `llama_stack/distribution/server/server.py`: ``` AttributeError: 'coroutine' object has no attribute 'aclose'. Did you mean: 'close'? ``` This was because we were calling `aclose` on a coroutine instead of the awaited value from that coroutine. This change fixes that, so that we save off the awaited value and then can call `aclose` on it if we encounter an `asyncio.CancelledError`, like we see when a client disconnects before we're finished streaming. The other changes in here are to add a simple set of tests for the happy path of our SSE streaming and this client disconnect path. That unfortunately requires adding one more dependency into our unit test section of pyproject.toml since `server.py` requires loading some of the telemetry code for me to test this functionality. ## Test Plan I wrote the tests in `tests/unit/server/test_sse.py` first, verified the client disconnected test failed before my change, and that it passed afterwards. ``` python -m pytest -s -v tests/unit/server/test_sse.py ``` Signed-off-by: Ben Browning <bbrownin@redhat.com>		2025-04-23 15:44:28 +02:00
..
apis	feat(agents): add agent naming functionality (#1922 )	2025-04-17 07:02:47 -07:00
cli	feat: allow building distro with external providers (#1967 )	2025-04-18 17:18:28 +02:00
distribution	fix: properly handle streaming client disconnects (#2000 )	2025-04-23 15:44:28 +02:00
models	fix: OAI compat endpoint for meta reference inference provider (#1962 )	2025-04-17 11:16:04 -07:00
providers	fix: Added lazy initialization of the remote vLLM client to avoid issues with expired asyncio event loop (#1969 )	2025-04-23 15:33:19 +02:00
strong_typing	chore: more mypy checks (ollama, vllm, ...) (#1777 )	2025-04-01 17:12:39 +02:00
templates	feat: Update NVIDIA to GA docs; remove notebook reference until ready (#1999 )	2025-04-18 19:13:18 -04:00
__init__.py	export LibraryClient	2024-12-13 12:08:00 -08:00
env.py	refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401 )	2025-03-04 14:53:47 -08:00
log.py	chore: Remove style tags from log formatter (#1808 )	2025-03-27 10:18:21 -04:00
schema_utils.py	fix: dont check protocol compliance for experimental methods	2025-04-12 16:26:32 -07:00