Ashwin Bharambe
|
6fa074168e
|
update paths
|
2024-08-29 16:14:45 -07:00 |
|
Ashwin Bharambe
|
d12aa64bbf
|
Add termcolor
|
2024-08-29 16:04:00 -07:00 |
|
Ashwin Bharambe
|
3cb67f1f58
|
llama_toolchain/distribution -> llama_toolchain/core
|
2024-08-28 17:43:08 -07:00 |
|
Ashwin Bharambe
|
81540e6ce8
|
Update cli_reference.md
|
2024-08-28 17:36:32 -07:00 |
|
Ashwin Bharambe
|
896f057b76
|
Updated README phew
|
2024-08-28 17:34:23 -07:00 |
|
Ashwin Bharambe
|
3063329dad
|
Some quick fixes to the CLI behavior to make it consistent
|
2024-08-28 17:17:46 -07:00 |
|
Ashwin Bharambe
|
f1244f6d9e
|
Make Fireworks and Together into the Adapter format
|
2024-08-28 16:25:16 -07:00 |
|
Ashwin Bharambe
|
a23a6ab95b
|
Merge remote-tracking branch 'origin/main' into api_updates_1
|
2024-08-28 16:08:06 -07:00 |
|
Hassan El Mghari
|
f2e18826b6
|
Together AI basic integration (#43)
* working!
* accounting for eos
|
2024-08-28 16:07:13 -07:00 |
|
Ashwin Bharambe
|
d3965dd435
|
Merge remote-tracking branch 'origin/main' into api_updates_1
|
2024-08-28 16:02:34 -07:00 |
|
Ashwin Bharambe
|
197f768636
|
All the new CLI for api + stack work
|
2024-08-28 15:55:57 -07:00 |
|
Ashwin Bharambe
|
fd3b65b718
|
llama distribution -> llama stack + containers (WIP)
|
2024-08-28 15:55:21 -07:00 |
|
Ashwin Bharambe
|
45987996c4
|
Several smaller fixes to make adapters work
Also, reorganized the pattern of __init__ inside providers so
configuration can stay lightweight
|
2024-08-28 15:55:21 -07:00 |
|
Ashwin Bharambe
|
2a1552a5eb
|
ollama remote adapter works
|
2024-08-28 15:55:21 -07:00 |
|
Ashwin Bharambe
|
2076d2b6db
|
api build works for conda now
|
2024-08-28 15:55:21 -07:00 |
|
Ashwin Bharambe
|
c4fe72c3a3
|
bunch more work to make adapters work
|
2024-08-28 15:55:18 -07:00 |
|
Ashwin Bharambe
|
68f3db62e9
|
<WIP> adapters
|
2024-08-28 15:54:31 -07:00 |
|
Ashwin Bharambe
|
a4af9675ac
|
build + run image seems to work
|
2024-08-28 15:54:31 -07:00 |
|
Ashwin Bharambe
|
6f83187809
|
fix
|
2024-08-28 15:54:31 -07:00 |
|
Ashwin Bharambe
|
3a337c5f1c
|
Add api build subcommand -- WIP
|
2024-08-28 15:54:31 -07:00 |
|
Hardik Shah
|
f5620c09ad
|
Rag Updates
|
2024-08-27 20:09:33 -07:00 |
|
Ashwin Bharambe
|
a8b9541f19
|
Bump version to 0.0.10
|
2024-08-27 04:19:27 -07:00 |
|
raghotham
|
117b95b38c
|
Update RFC-0001-llama-stack.md
Added link to sequence diagram from agentic system
|
2024-08-26 20:56:09 -07:00 |
|
Hardik Shah
|
ea6d9ec937
|
templates take optional --format={json,function_tag}
|
2024-08-26 17:42:24 -07:00 |
|
Hardik Shah
|
69d9655ecd
|
Add ToolPromptFormat to ChatFormat.encode_message so that tools are encoded properly
|
2024-08-26 17:42:24 -07:00 |
|
Ashwin Bharambe
|
870cd7bb8b
|
Add blobfile for tiktoken
|
2024-08-26 14:50:53 -07:00 |
|
Ashwin Bharambe
|
decbbc127b
|
Add blobfile for tiktoken
|
2024-08-26 14:50:20 -07:00 |
|
Ashwin Bharambe
|
fd1c7f0197
|
Fix api.datatypes imports
|
2024-08-26 14:43:30 -07:00 |
|
Ashwin Bharambe
|
fb78bdc5a9
|
use interleaved_text_media_as_str() utilityt
|
2024-08-26 14:40:28 -07:00 |
|
Hardik Shah
|
e61b3d91ef
|
use a single impl for ChatFormat.decode_assistant_mesage
|
2024-08-26 14:27:32 -07:00 |
|
Hardik Shah
|
c3708859aa
|
minor import fixes
|
2024-08-26 14:27:32 -07:00 |
|
Ashwin Bharambe
|
dc433f6c90
|
split batch_inference from inference
|
2024-08-26 13:21:37 -07:00 |
|
Ashwin Bharambe
|
986a865e62
|
Attachment / add TTL api
|
2024-08-26 13:11:37 -07:00 |
|
Ashwin Bharambe
|
3230af4910
|
combine datatypes.py and endpoints.py into api.py
|
2024-08-26 12:58:04 -07:00 |
|
Ashwin Bharambe
|
c1078a60e7
|
remove api.endpoints imports
|
2024-08-26 12:58:04 -07:00 |
|
Hardik Shah
|
df489261ac
|
add special unicode character ↵ to showcase newlines in model prompt templates
|
2024-08-26 07:35:49 -07:00 |
|
Ashwin Bharambe
|
091eca0ba4
|
No need for api_key for Remote providers
|
2024-08-25 21:14:16 -07:00 |
|
Ashwin Bharambe
|
0760849a1f
|
Bug fix, show memory retrieval steps in EventLogger
|
2024-08-25 15:03:49 -07:00 |
|
Ashwin Bharambe
|
ceef117abc
|
Refactor custom tool execution utilities
|
2024-08-25 14:34:20 -07:00 |
|
Yufei (Benny) Chen
|
40ca8e21bd
|
Fireworks basic integration (#39)
|
2024-08-25 08:05:52 -07:00 |
|
Ashwin Bharambe
|
440d125ea0
|
small bug fixes for inline attachments
|
2024-08-24 23:51:27 -07:00 |
|
Ashwin Bharambe
|
58e2feceb0
|
basic RAG seems to work
|
2024-08-24 23:36:58 -07:00 |
|
Ashwin Bharambe
|
830252257b
|
fix agentic_system utils
|
2024-08-24 22:56:43 -07:00 |
|
Ashwin Bharambe
|
8efe614719
|
re-work tool definitions, fix FastAPI issues, fix tool regressions
|
2024-08-24 22:35:56 -07:00 |
|
Ashwin Bharambe
|
8d14d4228b
|
memory client works
|
2024-08-24 18:43:49 -07:00 |
|
Ashwin Bharambe
|
a08958c000
|
faiss provider implementation
|
2024-08-24 14:50:08 -07:00 |
|
Ashwin Bharambe
|
f812648aca
|
Bump version to 0.0.9
|
2024-08-24 09:45:01 -07:00 |
|
Ashwin Bharambe
|
14637bea66
|
agentic loop has a RAG implementation
|
2024-08-23 21:01:11 -07:00 |
|
Ashwin Bharambe
|
77d6055d9f
|
flesh out memory banks API
|
2024-08-23 21:01:08 -07:00 |
|
Ashwin Bharambe
|
31289e3f47
|
InterleavedTextAttachment -> InterleavedTextMedia, introduce memory tool
|
2024-08-23 21:00:19 -07:00 |
|