Commit graph

85 commits

Author SHA1 Message Date
Hardik Shah
6f0d348b1c add init for common 2024-07-22 11:51:10 -07:00
Ashwin Bharambe
54a22e288a requirements 2024-07-22 11:39:42 -07:00
Ashwin Bharambe
c38d638340 sku -> family 2024-07-22 11:15:04 -07:00
Ashwin Bharambe
f0e0903270 add llama model subcommand 2024-07-22 11:07:11 -07:00
Hardik Shah
4417407652 agentic_system --> llama_agentic_system 2024-07-22 01:20:32 -07:00
Hardik Shah
1eac470045 add __init__ 2024-07-22 01:17:54 -07:00
Ashwin Bharambe
2e7978fa39 update import for quantization format from models 2024-07-22 00:04:03 -07:00
Hardik Shah
f9111652ef rename toolchain/ --> llama_toolchain/ 2024-07-21 23:48:38 -07:00
Hardik Shah
d95f5f863d use default_config file to configure inference 2024-07-21 19:26:11 -07:00
Hardik Shah
c64b8cba22 from models.llama3_1 --> from llama_models.llama3_1 2024-07-21 19:07:02 -07:00
Hardik Shah
c6ef16f6bd consol_scripts for toolchain 2024-07-21 17:39:47 -07:00
rsm
7c69675b79 added pypi package 2024-07-21 13:43:36 -07:00
Hardik Shah
b0f3406a08 deleting bash script as this is not done via cli 2024-07-21 12:55:49 -07:00
Hardik Shah
6bcd826b32 enable import of subcommands from llama-agentic-system 2024-07-21 12:54:48 -07:00
rsm
67f0510edd rename ModelInference to Inference 2024-07-21 12:20:32 -07:00
Ashwin Bharambe
245461620d make sure scripts always have pipefail 2024-07-21 12:18:49 -07:00
Hardik Shah
c9f33d8f68 cli updates 2024-07-21 01:51:54 -07:00
Hardik Shah
23fe353e4a cli -- llama inference configure 2024-07-21 01:17:15 -07:00
Ashwin Bharambe
0df57c4447 fix bad merge with injection shield? 2024-07-20 23:54:44 -07:00
Hardik Shah
2408bd81c8 easy script to create config 2024-07-20 23:51:46 -07:00
Ashwin Bharambe
7c9ed3e58e update README a bit 2024-07-20 23:26:50 -07:00
Ashwin Bharambe
d73fed5cc3 cleanup for fp8 and requirements etc 2024-07-20 23:21:55 -07:00
Hardik Shah
2428701951 download inside model_name directory 2024-07-20 23:16:19 -07:00
Ashwin Bharambe
0746a0f62b fp8 inference 2024-07-20 23:13:47 -07:00
Ashwin Bharambe
ad62e2e1f3 make inference server load checkpoints for fp8 inference
- introduce quantization related args for inference config
- also kill GeneratorArgs
2024-07-20 22:54:48 -07:00
Ashwin Bharambe
7d2c0b14b8 Changes from the main repo 2024-07-20 22:52:29 -07:00
Hardik Shah
9c9b834c0f update prompt-shield to reflect latest changes in agentic 2024-07-19 18:12:09 -07:00
Hardik Shah
ce0804556b update requirements for running standalone 2024-07-19 18:11:25 -07:00
Hardik Shah
2ed2881a21 fixed imports models.llama3. --> models.llama3_1.api. 2024-07-19 17:42:14 -07:00
Ashwin Bharambe
f94efcf2ee kill older junk 2024-07-19 12:32:22 -07:00
Ashwin Bharambe
95781ec85d Add toolchain from agentic system here 2024-07-19 12:30:35 -07:00
Ashwin Bharambe
f6b2b2fb39 cleanup 2024-07-11 10:04:56 -07:00
Raghotham Murthy
6d6c07b882 added more docs 2024-07-11 03:12:28 -07:00
Raghotham Murthy
8631d90f1e added more docs 2024-07-11 03:11:45 -07:00
Raghotham Murthy
e657e71446 added more docs 2024-07-11 03:10:30 -07:00
Raghotham Murthy
ab44e9c862 added more docs 2024-07-11 03:09:45 -07:00
raghotham
62f2db8f62
saving the spec changes 2024-07-11 05:02:16 -04:00
Raghotham Murthy
0e4b9efedf added more docs 2024-07-11 01:54:03 -07:00
Raghotham Murthy
9070d45629 added more docs 2024-07-11 01:48:13 -07:00
Raghotham Murthy
65556d0a1c added more docs 2024-07-11 01:46:33 -07:00
Raghotham Murthy
b88f8ad616 added more docs 2024-07-11 01:38:04 -07:00
Raghotham Murthy
6778359493 added more docs 2024-07-11 01:36:02 -07:00
Raghotham Murthy
dad480ded7 added more docs 2024-07-11 01:33:24 -07:00
Raghotham Murthy
0eabaffc3f added more docs 2024-07-11 01:32:24 -07:00
Raghotham Murthy
f431c18efc added more docs 2024-07-11 01:32:17 -07:00
Raghotham Murthy
5704847d74 added more docs 2024-07-11 01:26:42 -07:00
Raghotham Murthy
bf18ba3940 added more docs 2024-07-11 01:26:34 -07:00
Raghotham Murthy
067ec4ce50 added memory_bank endpoint 2024-07-11 00:14:28 -07:00
Ashwin Bharambe
86c2993296 small rename 2024-07-11 00:07:19 -07:00
Ashwin Bharambe
631328f556 added DPO 2024-07-11 00:01:58 -07:00