Xi Yan
c4534217c8
fix cli describe
2024-09-24 14:41:19 -07:00
Ashwin Bharambe
00352bd251
Respect passed in embedding model
2024-09-24 14:40:28 -07:00
Ashwin Bharambe
bda974e660
Make the "all-remote" distribution lightweight in dependencies and size
2024-09-24 14:18:57 -07:00
Ashwin Bharambe
445536de64
Add httpx to core server deps
2024-09-24 10:42:04 -07:00
Ashwin Bharambe
7b35a4c827
Bump version to 0.0.24
2024-09-24 10:15:20 -07:00
Ashwin Bharambe
8d511cdf91
Make build_conda_env a bit more robust
2024-09-24 10:12:07 -07:00
Ashwin Bharambe
cd850c16de
Bump version to 0.0.23
2024-09-24 09:08:40 -07:00
Xi Yan
d04cd97aba
remove providers/impls/sqlite/*
2024-09-24 01:03:40 -07:00
Ashwin Bharambe
e617273d8c
attribute changed (model_args -> arch_args)
2024-09-23 21:44:26 -07:00
Ashwin Bharambe
f136f802b1
Somewhat better error handling
2024-09-23 21:40:14 -07:00
Xi Yan
f92ff86b96
fix shields in agents safety
2024-09-23 21:22:22 -07:00
Ashwin Bharambe
c9005e95ed
Another attempt at a proper bugfix for safety violations
2024-09-23 19:06:30 -07:00
Xi Yan
e5bdd6615a
bug fix for safety violation
2024-09-23 18:17:15 -07:00
Xi Yan
70fb70a71c
fix URL issue with agents
2024-09-23 16:44:25 -07:00
Ashwin Bharambe
9eb5ec3e4b
Bump version to 0.0.21
2024-09-23 14:23:21 -07:00
Ashwin Bharambe
ec4fc800cc
[API Updates] Model / shield / memory-bank routing + agent persistence + support for private headers ( #92 )
...
This is yet another of those large PRs (hopefully we will have less and less of them as things mature fast). This one introduces substantial improvements and some simplifications to the stack.
Most important bits:
* Agents reference implementation now has support for session / turn persistence. The default implementation uses sqlite but there's also support for using Redis.
* We have re-architected the structure of the Stack APIs to allow for more flexible routing. The motivating use cases are:
- routing model A to ollama and model B to a remote provider like Together
- routing shield A to local impl while shield B to a remote provider like Bedrock
- routing a vector memory bank to Weaviate while routing a keyvalue memory bank to Redis
* Support for provider specific parameters to be passed from the clients. A client can pass data using `x_llamastack_provider_data` parameter which can be type-checked and provided to the Adapter implementations.
2024-09-23 14:22:22 -07:00
Hardik Shah
8bf8c07eb3
Respect user sent instructions in agent config and add them to system prompt
2024-09-21 16:46:10 -07:00
Xi Yan
06abd7e6c8
update MemoryToolDefinition
2024-09-20 17:51:53 -07:00
Ashwin Bharambe
942cb87a3c
remove apis/stack.py
2024-09-20 09:37:08 -07:00
Hardik Shah
33db4d2e45
ignore config dir
2024-09-20 00:24:49 -07:00
Hardik Shah
7e9e6117e3
do not assume CONDA_PREFIX exists during configuration
2024-09-19 23:39:34 -07:00
Hardik Shah
8fa49593e0
Allow TGI adaptor to have non-standard llama model names ( #84 )
...
Co-authored-by: Hardik Shah <hjshah@fb.com>
2024-09-19 21:42:15 -07:00
Hardik Shah
42d29f3a5a
Allow TGI adaptor to have non-standard llama model names
2024-09-19 21:37:02 -07:00
Xi Yan
59af1c8fec
fix memory url parsing ( #81 )
2024-09-19 13:35:03 -07:00
Ashwin Bharambe
132f9429b1
Add a test for CLI, but not fully done so disabled
2024-09-19 13:27:07 -07:00
Ashwin Bharambe
8b3ffa33de
Add another test case
2024-09-19 13:02:57 -07:00
Ashwin Bharambe
abb43936ab
Add a test runner and 2 very simple tests for agents
2024-09-19 12:22:48 -07:00
Xi Yan
543222ac39
update inference prompt msg
2024-09-19 12:03:24 -07:00
Xi Yan
a30b919ae1
update inference prompt msg
2024-09-19 12:03:24 -07:00
Ashwin Bharambe
9eb01dd664
Add DOCKER_BINARY / DOCKER_OPTS to all scripts
2024-09-19 10:26:41 -07:00
Xi Yan
ca4b87aa05
fix memory client
2024-09-19 09:29:40 -07:00
Xi Yan
6302a1ee90
fix prompt with name args ( #80 )
2024-09-18 23:48:31 -07:00
Ashwin Bharambe
c63d6cbd08
list(...keys()) so dict_keys does not show up
2024-09-18 23:24:07 -07:00
Xi Yan
880ed37026
Update cli_reference.md
2024-09-18 23:05:24 -07:00
Xi Yan
5c4a2dc0e1
Update getting_started.md
2024-09-18 23:03:14 -07:00
Ashwin Bharambe
f5eda1decf
Add default for max_seq_len
2024-09-18 21:59:10 -07:00
Ashwin Bharambe
9ab27e852b
Bug fixes for memory
2024-09-18 21:54:02 -07:00
Ashwin Bharambe
8cdc2f0cfb
No RunShieldRequest
2024-09-18 20:38:21 -07:00
Xi Yan
f3f5873e9e
regenerate openapi spec
2024-09-18 19:28:05 -07:00
Xi Yan
9f1be108ce
Bump version to 0.0.20
2024-09-18 19:06:07 -07:00
Xi Yan
455a6e4bb9
update MANIFEST
2024-09-18 18:58:50 -07:00
Ashwin Bharambe
dff9eab48f
Remove "APIs to serve" prompt
2024-09-18 18:26:26 -07:00
Xi Yan
f5d5e32d62
fix docker configure
2024-09-18 17:23:37 -07:00
Xi Yan
5ec64ac68c
moving rfc->docs
2024-09-18 16:54:24 -07:00
Xi Yan
2c1ad10710
move openapi from rfcs->docs
2024-09-18 16:09:17 -07:00
Xi Yan
21058be0c1
Bump version to 0.0.19
2024-09-18 15:48:38 -07:00
Xi Yan
45e20ff431
update getting started
2024-09-18 15:40:48 -07:00
Xi Yan
2f9e952813
update getting started guide
2024-09-18 15:35:54 -07:00
Hardik Shah
29ce73ff7a
update requirements, added prompt-toolkit
2024-09-18 15:21:45 -07:00
Xi Yan
1128f69674
CLI: add build templates support, move imports ( #77 )
...
* list templates implementation
* relative path
* finalize templates
* remove imports
* remove templates from name, name templates
* fix docker
* fix docker
2024-09-18 14:25:53 -07:00