Ashwin Bharambe
f27d629fe8
Reduce a bunch of dependencies from toolchain
...
Some improvements to the distribution install script
2024-08-07 21:55:07 -07:00
Hardik Shah
171a178783
get ollama working
2024-08-07 17:53:15 -07:00
Ashwin Bharambe
66412b932b
Nuke fp8_requirements, fold fbgemm into common requirements
2024-08-07 13:58:55 -07:00
Hardik Shah
0a67f3d3e6
installation fixes
2024-08-05 18:04:44 -07:00
Ashwin Bharambe
9dafa6ad94
implement full-passthrough in the server
2024-08-03 14:15:20 -07:00
Hardik Shah
156bfa0e15
Added Ollama as an inference impl ( #20 )
...
* fix non-streaming api in inference server
* unit test for inline inference
* Added non-streaming ollama inference impl
* add streaming support for ollama inference with tests
* addressing comments
---------
Co-authored-by: Hardik Shah <hjshah@fb.com>
2024-07-31 22:08:37 -07:00
Ashwin Bharambe
17bd1d876c
Canonical package name for the dependency
2024-07-23 13:30:33 -07:00
Ashwin Bharambe
f7e053e3ba
Updates to setup and requirements for PyPI
2024-07-23 13:28:30 -07:00
Ashwin Bharambe
5d5acc8ed5
Initial commit
2024-07-23 08:32:33 -07:00