llama-stack-mirror/tests
Ben Browning 45ef6eac10 fix: properly handle streaming client disconnects
Previously, when a streaming client would disconnect before we were
finished streaming the entire response, an error like the below would
get raised from the `sse_generator` function in
`llama_stack/distribution/server/server.py`:

```
AttributeError: 'coroutine' object has no attribute 'aclose'. Did you mean: 'close'?
```

This was because we were calling `aclose` on a coroutine instead of
the awaited value from that coroutine. This change fixes that, so that
we save off the awaited value and then can call `aclose` on it if we
encounter an `asyncio.CancelledError`, like we see when a client
disconnects before we're finished streaming.

The other changes in here are to add a simple set of tests for the
happy path of our SSE streaming and this client disconnect path.

That unfortunately requires adding one more dependency into our unit
test section of pyproject.toml since `server.py` requires loading some
of the telemetry code for me to test this functionality.

Signed-off-by: Ben Browning <bbrownin@redhat.com>
2025-04-21 16:10:45 -04:00
..
client-sdk/post_training feat: Add nemo customizer (#1448) 2025-03-25 11:01:10 -07:00
external-provider/llama-stack-provider-ollama feat: allow building distro with external providers (#1967) 2025-04-18 17:18:28 +02:00
integration feat(agents): add agent naming functionality (#1922) 2025-04-17 07:02:47 -07:00
unit fix: properly handle streaming client disconnects 2025-04-21 16:10:45 -04:00
verifications test: add multi_image test (#1972) 2025-04-17 12:51:42 -07:00
__init__.py refactor(test): introduce --stack-config and simplify options (#1404) 2025-03-05 17:02:02 -08:00