llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-06-27 18:50:41 +00:00

Author	SHA1	Message	Date
Xi Yan	06abd7e6c8	update MemoryToolDefinition	2024-09-20 17:51:53 -07:00
Ashwin Bharambe	942cb87a3c	remove apis/stack.py	2024-09-20 09:37:08 -07:00
Hardik Shah	7e9e6117e3	do not assume CONDA_PREFIX exists during configuration	2024-09-19 23:39:34 -07:00
Hardik Shah	8fa49593e0	Allow TGI adaptor to have non-standard llama model names (#84 ) Co-authored-by: Hardik Shah <hjshah@fb.com>	2024-09-19 21:42:15 -07:00
Hardik Shah	42d29f3a5a	Allow TGI adaptor to have non-standard llama model names	2024-09-19 21:37:02 -07:00
Xi Yan	59af1c8fec	fix memory url parsing (#81 )	2024-09-19 13:35:03 -07:00
Ashwin Bharambe	132f9429b1	Add a test for CLI, but not fully done so disabled	2024-09-19 13:27:07 -07:00
Ashwin Bharambe	8b3ffa33de	Add another test case	2024-09-19 13:02:57 -07:00
Ashwin Bharambe	abb43936ab	Add a test runner and 2 very simple tests for agents	2024-09-19 12:22:48 -07:00
Xi Yan	543222ac39	update inference prompt msg	2024-09-19 12:03:24 -07:00
Xi Yan	a30b919ae1	update inference prompt msg	2024-09-19 12:03:24 -07:00
Ashwin Bharambe	9eb01dd664	Add DOCKER_BINARY / DOCKER_OPTS to all scripts	2024-09-19 10:26:41 -07:00
Xi Yan	ca4b87aa05	fix memory client	2024-09-19 09:29:40 -07:00
Xi Yan	6302a1ee90	fix prompt with name args (#80 )	2024-09-18 23:48:31 -07:00
Ashwin Bharambe	c63d6cbd08	list(...keys()) so dict_keys does not show up	2024-09-18 23:24:07 -07:00
Ashwin Bharambe	f5eda1decf	Add default for max_seq_len	2024-09-18 21:59:10 -07:00
Ashwin Bharambe	9ab27e852b	Bug fixes for memory	2024-09-18 21:54:02 -07:00
Ashwin Bharambe	8cdc2f0cfb	No RunShieldRequest	2024-09-18 20:38:21 -07:00
Ashwin Bharambe	dff9eab48f	Remove "APIs to serve" prompt	2024-09-18 18:26:26 -07:00
Xi Yan	f5d5e32d62	fix docker configure	2024-09-18 17:23:37 -07:00
Xi Yan	1128f69674	CLI: add build templates support, move imports (#77 ) * list templates implementation * relative path * finalize templates * remove imports * remove templates from name, name templates * fix docker * fix docker	2024-09-18 14:25:53 -07:00
Xi Yan	6b21523c28	CLI - add back build wizard, configure with name instead of build.yaml (#74 ) * add back wizard for build * conda build path move * polish message * run with name only * prompt for build * improve comments * update msgs * add new lines * move build.yaml * address comments * validator for providers * move imports * Please enter -> enter * comments, get started guide * nits * fix cprint import * fix imports	2024-09-18 11:41:56 -07:00
Xi Yan	e6fdb9df29	fix context retriever (#75 )	2024-09-18 08:24:36 -07:00
Ashwin Bharambe	055770a791	Stop asking for "apis to serve" as part of configure	2024-09-17 22:41:10 -07:00
Ashwin Bharambe	9fd431e710	make shield imports more lazy	2024-09-17 21:27:37 -07:00
Ashwin Bharambe	3e27131a69	Don't import `pkg_resources` until you need it	2024-09-17 20:01:22 -07:00
Ashwin Bharambe	25adc83de8	Fix for safety	2024-09-17 19:56:58 -07:00
Ashwin Bharambe	9487ad8294	API Updates (#73 ) * API Keys passed from Client instead of distro configuration * delete distribution registry * Rename the "package" word away * Introduce a "Router" layer for providers Some providers need to be factorized and considered as thin routing layers on top of other providers. Consider two examples: - The inference API should be a routing layer over inference providers, routed using the "model" key - The memory banks API is another instance where various memory bank types will be provided by independent providers (e.g., a vector store is served by Chroma while a keyvalue memory can be served by Redis or PGVector) This commit introduces a generalized routing layer for this purpose. * update `apis_to_serve` * llama_toolchain -> llama_stack * Codemod from llama_toolchain -> llama_stack - added providers/registry - cleaned up api/ subdirectories and moved impls away - restructured api/api.py - from llama_stack.apis.<api> import foo should work now - update imports to do llama_stack.apis.<api> - update many other imports - added __init__, fixed some registry imports - updated registry imports - create_agentic_system -> create_agent - AgenticSystem -> Agent * Moved some stuff out of common/; re-generated OpenAPI spec * llama-toolchain -> llama-stack (hyphens) * add control plane API * add redis adapter + sqlite provider * move core -> distribution * Some more toolchain -> stack changes * small naming shenanigans * Removing custom tool and agent utilities and moving them client side * Move control plane to distribution server for now * Remove control plane from API list * no codeshield dependency randomly plzzzzz * Add "fire" as a dependency * add back event loggers * stack configure fixes * use brave instead of bing in the example client * add init file so it gets packaged * add init files so it gets packaged * Update MANIFEST * bug fix --------- Co-authored-by: Hardik Shah <hjshah@fb.com> Co-authored-by: Xi Yan <xiyan@meta.com> Co-authored-by: Ashwin Bharambe <ashwin@meta.com>	2024-09-17 19:51:35 -07:00

... 22 23 24 25 26

1278 commits