Ashwin Bharambe
d12aa64bbf
Add termcolor
2024-08-29 16:04:00 -07:00
Ashwin Bharambe
3cb67f1f58
llama_toolchain/distribution -> llama_toolchain/core
2024-08-28 17:43:08 -07:00
Ashwin Bharambe
81540e6ce8
Update cli_reference.md
2024-08-28 17:36:32 -07:00
Ashwin Bharambe
896f057b76
Updated README phew
2024-08-28 17:34:23 -07:00
Ashwin Bharambe
3063329dad
Some quick fixes to the CLI behavior to make it consistent
2024-08-28 17:17:46 -07:00
Ashwin Bharambe
f1244f6d9e
Make Fireworks and Together into the Adapter format
2024-08-28 16:25:16 -07:00
Ashwin Bharambe
a23a6ab95b
Merge remote-tracking branch 'origin/main' into api_updates_1
2024-08-28 16:08:06 -07:00
Hassan El Mghari
f2e18826b6
Together AI basic integration ( #43 )
...
* working!
* accounting for eos
2024-08-28 16:07:13 -07:00
Ashwin Bharambe
d3965dd435
Merge remote-tracking branch 'origin/main' into api_updates_1
2024-08-28 16:02:34 -07:00
Ashwin Bharambe
197f768636
All the new CLI for api + stack work
2024-08-28 15:55:57 -07:00
Ashwin Bharambe
fd3b65b718
llama distribution -> llama stack + containers (WIP)
2024-08-28 15:55:21 -07:00
Ashwin Bharambe
45987996c4
Several smaller fixes to make adapters work
...
Also, reorganized the pattern of __init__ inside providers so
configuration can stay lightweight
2024-08-28 15:55:21 -07:00
Ashwin Bharambe
2a1552a5eb
ollama remote adapter works
2024-08-28 15:55:21 -07:00
Ashwin Bharambe
2076d2b6db
api build works for conda now
2024-08-28 15:55:21 -07:00
Ashwin Bharambe
c4fe72c3a3
bunch more work to make adapters work
2024-08-28 15:55:18 -07:00
Ashwin Bharambe
68f3db62e9
<WIP> adapters
2024-08-28 15:54:31 -07:00
Ashwin Bharambe
a4af9675ac
build + run image seems to work
2024-08-28 15:54:31 -07:00
Ashwin Bharambe
6f83187809
fix
2024-08-28 15:54:31 -07:00
Ashwin Bharambe
3a337c5f1c
Add api build
subcommand -- WIP
2024-08-28 15:54:31 -07:00
Hardik Shah
f5620c09ad
Rag Updates
2024-08-27 20:09:33 -07:00
Ashwin Bharambe
a8b9541f19
Bump version to 0.0.10
2024-08-27 04:19:27 -07:00
raghotham
117b95b38c
Update RFC-0001-llama-stack.md
...
Added link to sequence diagram from agentic system
2024-08-26 20:56:09 -07:00
Hardik Shah
ea6d9ec937
templates take optional --format={json,function_tag}
2024-08-26 17:42:24 -07:00
Hardik Shah
69d9655ecd
Add ToolPromptFormat to ChatFormat.encode_message so that tools are encoded properly
2024-08-26 17:42:24 -07:00
Ashwin Bharambe
870cd7bb8b
Add blobfile for tiktoken
2024-08-26 14:50:53 -07:00
Ashwin Bharambe
decbbc127b
Add blobfile for tiktoken
2024-08-26 14:50:20 -07:00
Ashwin Bharambe
fd1c7f0197
Fix api.datatypes imports
2024-08-26 14:43:30 -07:00
Ashwin Bharambe
fb78bdc5a9
use interleaved_text_media_as_str() utilityt
2024-08-26 14:40:28 -07:00
Hardik Shah
e61b3d91ef
use a single impl for ChatFormat.decode_assistant_mesage
2024-08-26 14:27:32 -07:00
Hardik Shah
c3708859aa
minor import fixes
2024-08-26 14:27:32 -07:00
Ashwin Bharambe
dc433f6c90
split batch_inference from inference
2024-08-26 13:21:37 -07:00
Ashwin Bharambe
986a865e62
Attachment / add TTL api
2024-08-26 13:11:37 -07:00
Ashwin Bharambe
3230af4910
combine datatypes.py and endpoints.py into api.py
2024-08-26 12:58:04 -07:00
Ashwin Bharambe
c1078a60e7
remove api.endpoints imports
2024-08-26 12:58:04 -07:00
Hardik Shah
df489261ac
add special unicode character ↵ to showcase newlines in model prompt templates
2024-08-26 07:35:49 -07:00
Ashwin Bharambe
091eca0ba4
No need for api_key for Remote providers
2024-08-25 21:14:16 -07:00
Ashwin Bharambe
0760849a1f
Bug fix, show memory retrieval steps in EventLogger
2024-08-25 15:03:49 -07:00
Ashwin Bharambe
ceef117abc
Refactor custom tool execution utilities
2024-08-25 14:34:20 -07:00
Yufei (Benny) Chen
40ca8e21bd
Fireworks basic integration ( #39 )
2024-08-25 08:05:52 -07:00
Ashwin Bharambe
440d125ea0
small bug fixes for inline attachments
2024-08-24 23:51:27 -07:00
Ashwin Bharambe
58e2feceb0
basic RAG seems to work
2024-08-24 23:36:58 -07:00
Ashwin Bharambe
830252257b
fix agentic_system utils
2024-08-24 22:56:43 -07:00
Ashwin Bharambe
8efe614719
re-work tool definitions, fix FastAPI issues, fix tool regressions
2024-08-24 22:35:56 -07:00
Ashwin Bharambe
8d14d4228b
memory client works
2024-08-24 18:43:49 -07:00
Ashwin Bharambe
a08958c000
faiss provider implementation
2024-08-24 14:50:08 -07:00
Ashwin Bharambe
f812648aca
Bump version to 0.0.9
2024-08-24 09:45:01 -07:00
Ashwin Bharambe
14637bea66
agentic loop has a RAG implementation
2024-08-23 21:01:11 -07:00
Ashwin Bharambe
77d6055d9f
flesh out memory banks API
2024-08-23 21:01:08 -07:00
Ashwin Bharambe
31289e3f47
InterleavedTextAttachment -> InterleavedTextMedia, introduce memory tool
2024-08-23 21:00:19 -07:00
Ashwin Bharambe
48c6a32edd
<WIP> memory changes
...
- inlined AgenticSystemInstanceConfig so API feels more ergonomic
- renamed it to AgentConfig, AgentInstance -> Agent
- added a MemoryConfig and `memory` parameter
- added `attachments` to input and `output_attachments` to the response
- some naming changes
2024-08-23 21:00:17 -07:00