Hardik Shah
156bfa0e15
Added Ollama as an inference impl ( #20 )
...
* fix non-streaming api in inference server
* unit test for inline inference
* Added non-streaming ollama inference impl
* add streaming support for ollama inference with tests
* addressing comments
---------
Co-authored-by: Hardik Shah <hjshah@fb.com>
2024-07-31 22:08:37 -07:00
Ashwin Bharambe
c253c1c9ad
Begin adding a /safety/run_shield API
2024-07-31 21:57:10 -07:00
Ashwin Bharambe
1bc81eae7b
update toolchain to work with updated imports from llama_models
2024-07-30 17:52:57 -07:00
Ashwin Bharambe
23014ea4d1
Add hacks because Cloudfront config limits on the 405b model files
2024-07-30 13:46:47 -07:00
Ashwin Bharambe
404af06e02
Bump version to 0.0.2
2024-07-29 23:56:41 -07:00
Ashwin Bharambe
7306e6b167
show sampling params in model describe
2024-07-29 23:44:07 -07:00
Ashwin Bharambe
040c30ee54
added resumable downloader for downloading models
2024-07-29 23:29:16 -07:00
Ashwin Bharambe
59574924de
model template --template -> model template --name
2024-07-29 18:21:05 -07:00
Ashwin Bharambe
45b8a7ffcd
Add model describe subcommand
2024-07-29 18:19:53 -07:00
Ashwin Bharambe
9d7f283722
Add model list subcommand
2024-07-29 16:39:53 -07:00
Dalton Flanagan
a789c47ec9
Update cli_reference.md
2024-07-29 16:31:56 -04:00
Dalton Flanagan
dd6c1f1e64
Add links to shields
2024-07-27 11:28:46 -04:00
Dalton Flanagan
b5d7cec11e
Add shields to README
2024-07-27 11:02:50 -04:00
Ashwin Bharambe
3583cf2d51
update model template output to be prettier, more consumable
2024-07-26 15:39:46 -07:00
Ashwin Bharambe
51f8049c7a
Update fp8_requirements, we don't need nightly torch anymore
2024-07-26 08:25:44 -07:00
Dalton Flanagan
ec433448f2
Add CLI reference docs ( #14 )
...
* Add CLI reference doc
* touchups
* add helptext for download
2024-07-25 13:56:29 -07:00
Jianyu Huang
b8aa99b034
Update fbgemm version ( #12 )
2024-07-24 23:48:44 -07:00
Lucain
378a2077dd
Update download command ( #9 )
2024-07-24 16:50:40 -07:00
Ashwin Bharambe
17bd1d876c
Canonical package name for the dependency
2024-07-23 13:30:33 -07:00
Ashwin Bharambe
f7e053e3ba
Updates to setup and requirements for PyPI
2024-07-23 13:28:30 -07:00
Ashwin Bharambe
d802d0f051
add requirements to MANIFEST.in
2024-07-23 12:59:28 -07:00
Ashwin Bharambe
5d5acc8ed5
Initial commit
2024-07-23 08:32:33 -07:00