[inference] Add a TGI adapter (#52)

* TGI adapter and some refactoring of other inference adapters * Use the lower-level `generate_stream()` method for correct tool calling --------- Co-authored-by: Ashwin Bharambe <ashwin@meta.com>
2025-06-28 02:53:30 +00:00 · 2024-09-04 22:49:33 -07:00 · 2024-09-04 22:49:33 -07:00 · 21bedc1596
commit 21bedc1596
parent 6ad7365676
3 changed files with 256 additions and 0 deletions
--- a/llama_toolchain/inference/providers.py
+++ b/llama_toolchain/inference/providers.py
@ -35,6 +35,14 @@ def available_inference_providers() -> List[ProviderSpec]:
                module="llama_toolchain.inference.adapters.ollama",
            ),
        ),
+        remote_provider_spec(
+            api=Api.inference,
+            adapter=AdapterSpec(
+                adapter_id="tgi",
+                pip_packages=["text-generation"],
+                module="llama_toolchain.inference.adapters.tgi",
+            ),
+        ),
        remote_provider_spec(
            api=Api.inference,
            adapter=AdapterSpec(