Commit graph

8 commits

Author SHA1 Message Date
Hardik Shah
171a178783 get ollama working 2024-08-07 17:53:15 -07:00
Ashwin Bharambe
66412b932b Nuke fp8_requirements, fold fbgemm into common requirements 2024-08-07 13:58:55 -07:00
Hardik Shah
0a67f3d3e6 installation fixes 2024-08-05 18:04:44 -07:00
Ashwin Bharambe
9dafa6ad94 implement full-passthrough in the server 2024-08-03 14:15:20 -07:00
Hardik Shah
156bfa0e15
Added Ollama as an inference impl (#20)
* fix non-streaming api in inference server

* unit test for inline inference

* Added non-streaming ollama inference impl

* add streaming support for ollama inference with tests

* addressing comments

---------

Co-authored-by: Hardik Shah <hjshah@fb.com>
2024-07-31 22:08:37 -07:00
Ashwin Bharambe
17bd1d876c Canonical package name for the dependency 2024-07-23 13:30:33 -07:00
Ashwin Bharambe
f7e053e3ba Updates to setup and requirements for PyPI 2024-07-23 13:28:30 -07:00
Ashwin Bharambe
5d5acc8ed5 Initial commit 2024-07-23 08:32:33 -07:00