llama-stack-mirror/llama_stack
Ben Browning 544a804678 fix: Together provider shutdown and default to non-streaming
The together inference provider was throwing a stack trace every time
it shut down, as it was trying to call a non-existent `close` method
on the AsyncTogether client. While fixing that, I also adjusted its
shutdown logic to close the OpenAI client if we've created one of
those, as that client does have a `close` method.

In testing that, I also realized we were defaulting to treating all
requests as streaming requests instead of defaulting to
non-streaming. So, this flips that default to non-streaming to match
how the other providers work.

I tested this by ensuring the together inference provider no longer
spits out a long stack trace when shutting it down and by running the
OpenAI API chat completion  verification suite to ensure the change in
default streaming logic didn't mess anything else up.

Signed-off-by: Ben Browning <bbrownin@redhat.com>
2025-04-21 17:06:44 -04:00
..
apis feat(agents): add agent naming functionality (#1922) 2025-04-17 07:02:47 -07:00
cli feat: allow building distro with external providers (#1967) 2025-04-18 17:18:28 +02:00
distribution fix: update ref to raw_errors due to new version of pydantic (#1995) 2025-04-21 11:50:12 -07:00
models fix: OAI compat endpoint for meta reference inference provider (#1962) 2025-04-17 11:16:04 -07:00
providers fix: Together provider shutdown and default to non-streaming 2025-04-21 17:06:44 -04:00
strong_typing chore: more mypy checks (ollama, vllm, ...) (#1777) 2025-04-01 17:12:39 +02:00
templates feat: Update NVIDIA to GA docs; remove notebook reference until ready (#1999) 2025-04-18 19:13:18 -04:00
__init__.py export LibraryClient 2024-12-13 12:08:00 -08:00
env.py refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401) 2025-03-04 14:53:47 -08:00
log.py chore: Remove style tags from log formatter (#1808) 2025-03-27 10:18:21 -04:00
schema_utils.py fix: dont check protocol compliance for experimental methods 2025-04-12 16:26:32 -07:00