Ashwin Bharambe
|
fb3c4566ce
|
llama stack start -> llama stack run
|
2024-09-03 11:23:26 -07:00 |
|
Ashwin Bharambe
|
fab6bd1728
|
Update documentation again and add error messages to llama stack start
|
2024-09-02 21:36:32 -07:00 |
|
Ashwin Bharambe
|
279565499b
|
Fixes to llama stack commands and update docs
|
2024-09-02 18:58:54 -07:00 |
|
Ashwin Bharambe
|
5927f3c3c0
|
Remote llama api [] subcommands
|
2024-09-02 18:48:19 -07:00 |
|
Ashwin Bharambe
|
9be0edc76c
|
Allow building an "adhoc" distribution
|
2024-09-02 18:37:31 -07:00 |
|
Ashwin Bharambe
|
d99c06fce8
|
Fix stack start
|
2024-08-30 15:03:23 -07:00 |
|
Ashwin Bharambe
|
5172d9a79d
|
Update llama stack configure to be very simple also
|
2024-08-30 14:55:20 -07:00 |
|
Ashwin Bharambe
|
f8517e4688
|
Simplify and generalize llama api build yay
|
2024-08-30 14:51:40 -07:00 |
|
Ashwin Bharambe
|
297d51b183
|
Support downloading of URLs for attachments for code interpreter
|
2024-08-30 12:10:48 -07:00 |
|
Ashwin Bharambe
|
afb18880b5
|
Delete utils.py; move to agentic system
|
2024-08-30 11:53:21 -07:00 |
|
Ashwin Bharambe
|
9ec06918a5
|
missing import lol
|
2024-08-30 10:45:23 -07:00 |
|
Ashwin Bharambe
|
a2470aae11
|
Fix api dependencies not getting added to configuration
|
2024-08-30 10:40:17 -07:00 |
|
Dalton Flanagan
|
886a01ee2e
|
chmod +x scripts
|
2024-08-30 00:07:12 -04:00 |
|
Ashwin Bharambe
|
e53e115a5b
|
Add a log just for consistency
|
2024-08-29 16:19:43 -07:00 |
|
Ashwin Bharambe
|
6fa074168e
|
update paths
|
2024-08-29 16:14:45 -07:00 |
|
Ashwin Bharambe
|
d12aa64bbf
|
Add termcolor
|
2024-08-29 16:04:00 -07:00 |
|
Ashwin Bharambe
|
3cb67f1f58
|
llama_toolchain/distribution -> llama_toolchain/core
|
2024-08-28 17:43:08 -07:00 |
|
Ashwin Bharambe
|
81540e6ce8
|
Update cli_reference.md
|
2024-08-28 17:36:32 -07:00 |
|
Ashwin Bharambe
|
896f057b76
|
Updated README phew
|
2024-08-28 17:34:23 -07:00 |
|
Ashwin Bharambe
|
3063329dad
|
Some quick fixes to the CLI behavior to make it consistent
|
2024-08-28 17:17:46 -07:00 |
|
Ashwin Bharambe
|
f1244f6d9e
|
Make Fireworks and Together into the Adapter format
|
2024-08-28 16:25:16 -07:00 |
|
Ashwin Bharambe
|
a23a6ab95b
|
Merge remote-tracking branch 'origin/main' into api_updates_1
|
2024-08-28 16:08:06 -07:00 |
|
Hassan El Mghari
|
f2e18826b6
|
Together AI basic integration (#43)
* working!
* accounting for eos
|
2024-08-28 16:07:13 -07:00 |
|
Ashwin Bharambe
|
d3965dd435
|
Merge remote-tracking branch 'origin/main' into api_updates_1
|
2024-08-28 16:02:34 -07:00 |
|
Ashwin Bharambe
|
197f768636
|
All the new CLI for api + stack work
|
2024-08-28 15:55:57 -07:00 |
|
Ashwin Bharambe
|
fd3b65b718
|
llama distribution -> llama stack + containers (WIP)
|
2024-08-28 15:55:21 -07:00 |
|
Ashwin Bharambe
|
45987996c4
|
Several smaller fixes to make adapters work
Also, reorganized the pattern of __init__ inside providers so
configuration can stay lightweight
|
2024-08-28 15:55:21 -07:00 |
|
Ashwin Bharambe
|
2a1552a5eb
|
ollama remote adapter works
|
2024-08-28 15:55:21 -07:00 |
|
Ashwin Bharambe
|
2076d2b6db
|
api build works for conda now
|
2024-08-28 15:55:21 -07:00 |
|
Ashwin Bharambe
|
c4fe72c3a3
|
bunch more work to make adapters work
|
2024-08-28 15:55:18 -07:00 |
|
Ashwin Bharambe
|
68f3db62e9
|
<WIP> adapters
|
2024-08-28 15:54:31 -07:00 |
|
Ashwin Bharambe
|
a4af9675ac
|
build + run image seems to work
|
2024-08-28 15:54:31 -07:00 |
|
Ashwin Bharambe
|
6f83187809
|
fix
|
2024-08-28 15:54:31 -07:00 |
|
Ashwin Bharambe
|
3a337c5f1c
|
Add api build subcommand -- WIP
|
2024-08-28 15:54:31 -07:00 |
|
Hardik Shah
|
f5620c09ad
|
Rag Updates
|
2024-08-27 20:09:33 -07:00 |
|
Ashwin Bharambe
|
a8b9541f19
|
Bump version to 0.0.10
|
2024-08-27 04:19:27 -07:00 |
|
raghotham
|
117b95b38c
|
Update RFC-0001-llama-stack.md
Added link to sequence diagram from agentic system
|
2024-08-26 20:56:09 -07:00 |
|
Hardik Shah
|
ea6d9ec937
|
templates take optional --format={json,function_tag}
|
2024-08-26 17:42:24 -07:00 |
|
Hardik Shah
|
69d9655ecd
|
Add ToolPromptFormat to ChatFormat.encode_message so that tools are encoded properly
|
2024-08-26 17:42:24 -07:00 |
|
Ashwin Bharambe
|
870cd7bb8b
|
Add blobfile for tiktoken
|
2024-08-26 14:50:53 -07:00 |
|
Ashwin Bharambe
|
decbbc127b
|
Add blobfile for tiktoken
|
2024-08-26 14:50:20 -07:00 |
|
Ashwin Bharambe
|
fd1c7f0197
|
Fix api.datatypes imports
|
2024-08-26 14:43:30 -07:00 |
|
Ashwin Bharambe
|
fb78bdc5a9
|
use interleaved_text_media_as_str() utilityt
|
2024-08-26 14:40:28 -07:00 |
|
Hardik Shah
|
e61b3d91ef
|
use a single impl for ChatFormat.decode_assistant_mesage
|
2024-08-26 14:27:32 -07:00 |
|
Hardik Shah
|
c3708859aa
|
minor import fixes
|
2024-08-26 14:27:32 -07:00 |
|
Ashwin Bharambe
|
dc433f6c90
|
split batch_inference from inference
|
2024-08-26 13:21:37 -07:00 |
|
Ashwin Bharambe
|
986a865e62
|
Attachment / add TTL api
|
2024-08-26 13:11:37 -07:00 |
|
Ashwin Bharambe
|
3230af4910
|
combine datatypes.py and endpoints.py into api.py
|
2024-08-26 12:58:04 -07:00 |
|
Ashwin Bharambe
|
c1078a60e7
|
remove api.endpoints imports
|
2024-08-26 12:58:04 -07:00 |
|
Hardik Shah
|
df489261ac
|
add special unicode character ↵ to showcase newlines in model prompt templates
|
2024-08-26 07:35:49 -07:00 |
|