Commit graph

36 commits

Author SHA1 Message Date
Xi Yan
ddebf9b6e7
[api_updates_3] fix CLI for routing_table, bug fixes for memory & safety (#90)
* fix llama stack build

* fix configure

* fix configure for simple case

* configure w/ routing

* move examples config

* fix memory router naming

* issue w/ safety

* fix config w/ safety

* update memory endpoints

* allow providers in api_providers

* configure script works

* all endpoints w/ build->configure->run simple local works

* new example run.yaml

* run openapi generator
2024-09-23 08:46:33 -07:00
Ashwin Bharambe
bc394882a0 bug fix with routing tables 2024-09-22 17:28:45 -07:00
Ashwin Bharambe
59f1fe5af8 Agent persistence works 2024-09-22 16:34:46 -07:00
Ashwin Bharambe
4eb0f30891 initial cut at using kvstores for agent persistence 2024-09-22 16:34:44 -07:00
Ashwin Bharambe
61974e337f kvstore impls for redis / sqlite moved 2024-09-22 16:33:55 -07:00
Ashwin Bharambe
c1ab66f1e6
Further generalize Xi's changes (#88)
* Further generalize Xi's changes

- introduce a slightly more general notion of an AutoRouted provider
- the AutoRouted provider is associated with a RoutingTable provider
- e.g. inference -> models
- Introduced safety -> shields and memory -> memory_banks
  correspondences

* typo

* Basic build and run succeeded
2024-09-22 16:31:18 -07:00
Xi Yan
b5217fe6fc fix configure 2024-09-22 00:19:24 -07:00
Xi Yan
0348f26e00 models endpoint testing 2024-09-22 00:01:35 -07:00
Xi Yan
c0199029e5 supported models wip 2024-09-21 18:37:22 -07:00
Xi Yan
20a4302877 models API 2024-09-21 17:27:19 -07:00
Xi Yan
f0580251a3 memory routers working 2024-09-21 16:40:23 -07:00
Xi Yan
04f480d70c router method wrapper 2024-09-21 15:56:20 -07:00
Xi Yan
951cc9d7b7 router table registration works 2024-09-21 14:26:48 -07:00
Xi Yan
85d927adde skeleton unified routing table, api routers 2024-09-21 13:44:33 -07:00
Xi Yan
2dc14cba2c stage tmp changes 2024-09-21 12:46:02 -07:00
Xi Yan
abe312c092 Revert "example config"
This reverts commit 73399fe905.
2024-09-21 12:46:02 -07:00
Xi Yan
32b9907d69 Revert "add new resolve_impls_with_routing"
This reverts commit 34f0c11001.
2024-09-21 12:46:02 -07:00
Xi Yan
39c27a3d8c Revert "migrate router for memory wip"
This reverts commit 08379f5214.
2024-09-21 12:46:02 -07:00
Xi Yan
665ab1f812 Revert "delete router from providers"
This reverts commit d8fab77a4f.
2024-09-21 12:46:02 -07:00
Xi Yan
ee77431b64 Revert "clean up"
This reverts commit bc4ac2ceb4.
2024-09-21 12:46:01 -07:00
Xi Yan
5f9a7dcdcc Revert "backward compatibility"
This reverts commit 6a95edc806.
2024-09-21 12:46:01 -07:00
Xi Yan
73133fbeda Revert "stage tmp changes"
This reverts commit 164d0e25c7.
2024-09-21 12:46:01 -07:00
Ashwin Bharambe
32beecb20d Add a special header per-client call to parser provider data 2024-09-21 12:46:00 -07:00
Xi Yan
a6be32bc3d stage tmp changes 2024-09-20 15:33:31 -07:00
Xi Yan
308a1d1e46 backward compatibility 2024-09-20 13:47:58 -07:00
Xi Yan
3787408863 clean up 2024-09-20 13:02:10 -07:00
Xi Yan
9c33587003 delete router from providers 2024-09-20 12:21:44 -07:00
Xi Yan
cda61119ce migrate router for memory wip 2024-09-20 12:19:33 -07:00
Xi Yan
7d4135d5fd add new resolve_impls_with_routing 2024-09-20 11:36:55 -07:00
Xi Yan
9bb6ce54ff example config 2024-09-20 11:22:58 -07:00
Ashwin Bharambe
abb43936ab Add a test runner and 2 very simple tests for agents 2024-09-19 12:22:48 -07:00
Ashwin Bharambe
9eb01dd664 Add DOCKER_BINARY / DOCKER_OPTS to all scripts 2024-09-19 10:26:41 -07:00
Ashwin Bharambe
dff9eab48f Remove "APIs to serve" prompt 2024-09-18 18:26:26 -07:00
Xi Yan
1128f69674
CLI: add build templates support, move imports (#77)
* list templates implementation

* relative path

* finalize templates

* remove imports

* remove templates from name, name templates

* fix docker

* fix docker
2024-09-18 14:25:53 -07:00
Ashwin Bharambe
055770a791 Stop asking for "apis to serve" as part of configure 2024-09-17 22:41:10 -07:00
Ashwin Bharambe
9487ad8294
API Updates (#73)
* API Keys passed from Client instead of distro configuration

* delete distribution registry

* Rename the "package" word away

* Introduce a "Router" layer for providers

Some providers need to be factorized and considered as thin routing
layers on top of other providers. Consider two examples:

- The inference API should be a routing layer over inference providers,
  routed using the "model" key
- The memory banks API is another instance where various memory bank
  types will be provided by independent providers (e.g., a vector store
  is served by Chroma while a keyvalue memory can be served by Redis or
  PGVector)

This commit introduces a generalized routing layer for this purpose.

* update `apis_to_serve`

* llama_toolchain -> llama_stack

* Codemod from llama_toolchain -> llama_stack

- added providers/registry
- cleaned up api/ subdirectories and moved impls away
- restructured api/api.py
- from llama_stack.apis.<api> import foo should work now
- update imports to do llama_stack.apis.<api>
- update many other imports
- added __init__, fixed some registry imports
- updated registry imports
- create_agentic_system -> create_agent
- AgenticSystem -> Agent

* Moved some stuff out of common/; re-generated OpenAPI spec

* llama-toolchain -> llama-stack (hyphens)

* add control plane API

* add redis adapter + sqlite provider

* move core -> distribution

* Some more toolchain -> stack changes

* small naming shenanigans

* Removing custom tool and agent utilities and moving them client side

* Move control plane to distribution server for now

* Remove control plane from API list

* no codeshield dependency randomly plzzzzz

* Add "fire" as a dependency

* add back event loggers

* stack configure fixes

* use brave instead of bing in the example client

* add init file so it gets packaged

* add init files so it gets packaged

* Update MANIFEST

* bug fix

---------

Co-authored-by: Hardik Shah <hjshah@fb.com>
Co-authored-by: Xi Yan <xiyan@meta.com>
Co-authored-by: Ashwin Bharambe <ashwin@meta.com>
2024-09-17 19:51:35 -07:00