Hardik Shah
|
7574ffb25f
|
added __init__
|
2024-07-22 14:49:26 -07:00 |
|
Hardik Shah
|
441e5da6ed
|
no special casign for original
|
2024-07-22 14:42:38 -07:00 |
|
Hardik Shah
|
4d3b226275
|
check original folder
|
2024-07-22 14:35:09 -07:00 |
|
Hardik Shah
|
74442e88b1
|
add yaml to manifest
|
2024-07-22 13:34:08 -07:00 |
|
Hardik Shah
|
6f0d348b1c
|
add init for common
|
2024-07-22 11:51:10 -07:00 |
|
Ashwin Bharambe
|
54a22e288a
|
requirements
|
2024-07-22 11:39:42 -07:00 |
|
Ashwin Bharambe
|
c38d638340
|
sku -> family
|
2024-07-22 11:15:04 -07:00 |
|
Ashwin Bharambe
|
f0e0903270
|
add llama model subcommand
|
2024-07-22 11:07:11 -07:00 |
|
Hardik Shah
|
4417407652
|
agentic_system --> llama_agentic_system
|
2024-07-22 01:20:32 -07:00 |
|
Hardik Shah
|
1eac470045
|
add __init__
|
2024-07-22 01:17:54 -07:00 |
|
Ashwin Bharambe
|
2e7978fa39
|
update import for quantization format from models
|
2024-07-22 00:04:03 -07:00 |
|
Hardik Shah
|
f9111652ef
|
rename toolchain/ --> llama_toolchain/
|
2024-07-21 23:48:38 -07:00 |
|
Hardik Shah
|
d95f5f863d
|
use default_config file to configure inference
|
2024-07-21 19:26:11 -07:00 |
|
Hardik Shah
|
c64b8cba22
|
from models.llama3_1 --> from llama_models.llama3_1
|
2024-07-21 19:07:02 -07:00 |
|
Hardik Shah
|
c6ef16f6bd
|
consol_scripts for toolchain
|
2024-07-21 17:39:47 -07:00 |
|
rsm
|
7c69675b79
|
added pypi package
|
2024-07-21 13:43:36 -07:00 |
|
Hardik Shah
|
b0f3406a08
|
deleting bash script as this is not done via cli
|
2024-07-21 12:55:49 -07:00 |
|
Hardik Shah
|
6bcd826b32
|
enable import of subcommands from llama-agentic-system
|
2024-07-21 12:54:48 -07:00 |
|
rsm
|
67f0510edd
|
rename ModelInference to Inference
|
2024-07-21 12:20:32 -07:00 |
|
Ashwin Bharambe
|
245461620d
|
make sure scripts always have pipefail
|
2024-07-21 12:18:49 -07:00 |
|
Hardik Shah
|
c9f33d8f68
|
cli updates
|
2024-07-21 01:51:54 -07:00 |
|
Hardik Shah
|
23fe353e4a
|
cli -- llama inference configure
|
2024-07-21 01:17:15 -07:00 |
|
Ashwin Bharambe
|
0df57c4447
|
fix bad merge with injection shield?
|
2024-07-20 23:54:44 -07:00 |
|
Hardik Shah
|
2408bd81c8
|
easy script to create config
|
2024-07-20 23:51:46 -07:00 |
|
Ashwin Bharambe
|
7c9ed3e58e
|
update README a bit
|
2024-07-20 23:26:50 -07:00 |
|
Ashwin Bharambe
|
d73fed5cc3
|
cleanup for fp8 and requirements etc
|
2024-07-20 23:21:55 -07:00 |
|
Hardik Shah
|
2428701951
|
download inside model_name directory
|
2024-07-20 23:16:19 -07:00 |
|
Ashwin Bharambe
|
0746a0f62b
|
fp8 inference
|
2024-07-20 23:13:47 -07:00 |
|
Ashwin Bharambe
|
ad62e2e1f3
|
make inference server load checkpoints for fp8 inference
- introduce quantization related args for inference config
- also kill GeneratorArgs
|
2024-07-20 22:54:48 -07:00 |
|
Ashwin Bharambe
|
7d2c0b14b8
|
Changes from the main repo
|
2024-07-20 22:52:29 -07:00 |
|
Hardik Shah
|
9c9b834c0f
|
update prompt-shield to reflect latest changes in agentic
|
2024-07-19 18:12:09 -07:00 |
|
Hardik Shah
|
ce0804556b
|
update requirements for running standalone
|
2024-07-19 18:11:25 -07:00 |
|
Hardik Shah
|
2ed2881a21
|
fixed imports models.llama3. --> models.llama3_1.api.
|
2024-07-19 17:42:14 -07:00 |
|
Ashwin Bharambe
|
f94efcf2ee
|
kill older junk
|
2024-07-19 12:32:22 -07:00 |
|
Ashwin Bharambe
|
95781ec85d
|
Add toolchain from agentic system here
|
2024-07-19 12:30:35 -07:00 |
|
Ashwin Bharambe
|
f6b2b2fb39
|
cleanup
|
2024-07-11 10:04:56 -07:00 |
|
Raghotham Murthy
|
6d6c07b882
|
added more docs
|
2024-07-11 03:12:28 -07:00 |
|
Raghotham Murthy
|
8631d90f1e
|
added more docs
|
2024-07-11 03:11:45 -07:00 |
|
Raghotham Murthy
|
e657e71446
|
added more docs
|
2024-07-11 03:10:30 -07:00 |
|
Raghotham Murthy
|
ab44e9c862
|
added more docs
|
2024-07-11 03:09:45 -07:00 |
|
raghotham
|
62f2db8f62
|
saving the spec changes
|
2024-07-11 05:02:16 -04:00 |
|
Raghotham Murthy
|
0e4b9efedf
|
added more docs
|
2024-07-11 01:54:03 -07:00 |
|
Raghotham Murthy
|
9070d45629
|
added more docs
|
2024-07-11 01:48:13 -07:00 |
|
Raghotham Murthy
|
65556d0a1c
|
added more docs
|
2024-07-11 01:46:33 -07:00 |
|
Raghotham Murthy
|
b88f8ad616
|
added more docs
|
2024-07-11 01:38:04 -07:00 |
|
Raghotham Murthy
|
6778359493
|
added more docs
|
2024-07-11 01:36:02 -07:00 |
|
Raghotham Murthy
|
dad480ded7
|
added more docs
|
2024-07-11 01:33:24 -07:00 |
|
Raghotham Murthy
|
0eabaffc3f
|
added more docs
|
2024-07-11 01:32:24 -07:00 |
|
Raghotham Murthy
|
f431c18efc
|
added more docs
|
2024-07-11 01:32:17 -07:00 |
|
Raghotham Murthy
|
5704847d74
|
added more docs
|
2024-07-11 01:26:42 -07:00 |
|