Ashwin Bharambe
a0e61a3c7a
Fix passthrough streaming, send headers properly not part of body :facepalm
2024-08-06 16:39:38 -07:00
Ashwin Bharambe
be19b22391
Bring agentic system api to toolchain
...
Add adapter dependencies and resolve adapters using a topological sort
2024-08-04 17:33:29 -07:00
Ashwin Bharambe
9dafa6ad94
implement full-passthrough in the server
2024-08-03 14:15:20 -07:00
Ashwin Bharambe
af4710c959
Improved exception handling
2024-08-02 15:52:15 -07:00
Hardik Shah
493f0d99b2
updated dependency and client model name
2024-08-02 15:37:40 -07:00
Ashwin Bharambe
09cf3fe78b
Use new definitions of Model / SKU
2024-07-31 22:44:35 -07:00
Hardik Shah
156bfa0e15
Added Ollama as an inference impl ( #20 )
...
* fix non-streaming api in inference server
* unit test for inline inference
* Added non-streaming ollama inference impl
* add streaming support for ollama inference with tests
* addressing comments
---------
Co-authored-by: Hardik Shah <hjshah@fb.com>
2024-07-31 22:08:37 -07:00
Ashwin Bharambe
5d5acc8ed5
Initial commit
2024-07-23 08:32:33 -07:00