Commit graph

113 commits

Author SHA1 Message Date
Ashwin Bharambe
e53e115a5b Add a log just for consistency 2024-08-29 16:19:43 -07:00
Ashwin Bharambe
6fa074168e update paths 2024-08-29 16:14:45 -07:00
Ashwin Bharambe
d12aa64bbf Add termcolor 2024-08-29 16:04:00 -07:00
Ashwin Bharambe
3cb67f1f58 llama_toolchain/distribution -> llama_toolchain/core 2024-08-28 17:43:08 -07:00
Ashwin Bharambe
81540e6ce8
Update cli_reference.md 2024-08-28 17:36:32 -07:00
Ashwin Bharambe
896f057b76 Updated README phew 2024-08-28 17:34:23 -07:00
Ashwin Bharambe
3063329dad Some quick fixes to the CLI behavior to make it consistent 2024-08-28 17:17:46 -07:00
Ashwin Bharambe
f1244f6d9e Make Fireworks and Together into the Adapter format 2024-08-28 16:25:16 -07:00
Ashwin Bharambe
a23a6ab95b Merge remote-tracking branch 'origin/main' into api_updates_1 2024-08-28 16:08:06 -07:00
Hassan El Mghari
f2e18826b6
Together AI basic integration (#43)
* working!

* accounting for eos
2024-08-28 16:07:13 -07:00
Ashwin Bharambe
d3965dd435 Merge remote-tracking branch 'origin/main' into api_updates_1 2024-08-28 16:02:34 -07:00
Ashwin Bharambe
197f768636 All the new CLI for api + stack work 2024-08-28 15:55:57 -07:00
Ashwin Bharambe
fd3b65b718 llama distribution -> llama stack + containers (WIP) 2024-08-28 15:55:21 -07:00
Ashwin Bharambe
45987996c4 Several smaller fixes to make adapters work
Also, reorganized the pattern of __init__ inside providers so
configuration can stay lightweight
2024-08-28 15:55:21 -07:00
Ashwin Bharambe
2a1552a5eb ollama remote adapter works 2024-08-28 15:55:21 -07:00
Ashwin Bharambe
2076d2b6db api build works for conda now 2024-08-28 15:55:21 -07:00
Ashwin Bharambe
c4fe72c3a3 bunch more work to make adapters work 2024-08-28 15:55:18 -07:00
Ashwin Bharambe
68f3db62e9 <WIP> adapters 2024-08-28 15:54:31 -07:00
Ashwin Bharambe
a4af9675ac build + run image seems to work 2024-08-28 15:54:31 -07:00
Ashwin Bharambe
6f83187809 fix 2024-08-28 15:54:31 -07:00
Ashwin Bharambe
3a337c5f1c Add api build subcommand -- WIP 2024-08-28 15:54:31 -07:00
Hardik Shah
f5620c09ad Rag Updates 2024-08-27 20:09:33 -07:00
Ashwin Bharambe
a8b9541f19 Bump version to 0.0.10 2024-08-27 04:19:27 -07:00
raghotham
117b95b38c
Update RFC-0001-llama-stack.md
Added link to sequence diagram from agentic system
2024-08-26 20:56:09 -07:00
Hardik Shah
ea6d9ec937 templates take optional --format={json,function_tag} 2024-08-26 17:42:24 -07:00
Hardik Shah
69d9655ecd Add ToolPromptFormat to ChatFormat.encode_message so that tools are encoded properly 2024-08-26 17:42:24 -07:00
Ashwin Bharambe
870cd7bb8b Add blobfile for tiktoken 2024-08-26 14:50:53 -07:00
Ashwin Bharambe
decbbc127b Add blobfile for tiktoken 2024-08-26 14:50:20 -07:00
Ashwin Bharambe
fd1c7f0197 Fix api.datatypes imports 2024-08-26 14:43:30 -07:00
Ashwin Bharambe
fb78bdc5a9 use interleaved_text_media_as_str() utilityt 2024-08-26 14:40:28 -07:00
Hardik Shah
e61b3d91ef use a single impl for ChatFormat.decode_assistant_mesage 2024-08-26 14:27:32 -07:00
Hardik Shah
c3708859aa minor import fixes 2024-08-26 14:27:32 -07:00
Ashwin Bharambe
dc433f6c90 split batch_inference from inference 2024-08-26 13:21:37 -07:00
Ashwin Bharambe
986a865e62 Attachment / add TTL api 2024-08-26 13:11:37 -07:00
Ashwin Bharambe
3230af4910 combine datatypes.py and endpoints.py into api.py 2024-08-26 12:58:04 -07:00
Ashwin Bharambe
c1078a60e7 remove api.endpoints imports 2024-08-26 12:58:04 -07:00
Hardik Shah
df489261ac add special unicode character ↵ to showcase newlines in model prompt templates 2024-08-26 07:35:49 -07:00
Ashwin Bharambe
091eca0ba4 No need for api_key for Remote providers 2024-08-25 21:14:16 -07:00
Ashwin Bharambe
0760849a1f Bug fix, show memory retrieval steps in EventLogger 2024-08-25 15:03:49 -07:00
Ashwin Bharambe
ceef117abc Refactor custom tool execution utilities 2024-08-25 14:34:20 -07:00
Yufei (Benny) Chen
40ca8e21bd
Fireworks basic integration (#39) 2024-08-25 08:05:52 -07:00
Ashwin Bharambe
440d125ea0 small bug fixes for inline attachments 2024-08-24 23:51:27 -07:00
Ashwin Bharambe
58e2feceb0 basic RAG seems to work 2024-08-24 23:36:58 -07:00
Ashwin Bharambe
830252257b fix agentic_system utils 2024-08-24 22:56:43 -07:00
Ashwin Bharambe
8efe614719 re-work tool definitions, fix FastAPI issues, fix tool regressions 2024-08-24 22:35:56 -07:00
Ashwin Bharambe
8d14d4228b memory client works 2024-08-24 18:43:49 -07:00
Ashwin Bharambe
a08958c000 faiss provider implementation 2024-08-24 14:50:08 -07:00
Ashwin Bharambe
f812648aca Bump version to 0.0.9 2024-08-24 09:45:01 -07:00
Ashwin Bharambe
14637bea66 agentic loop has a RAG implementation 2024-08-23 21:01:11 -07:00
Ashwin Bharambe
77d6055d9f flesh out memory banks API 2024-08-23 21:01:08 -07:00