llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-06-27 18:50:41 +00:00

Author	SHA1	Message	Date
Justin Lee	9928405e2c	Docs improvement v3 (#433 ) # What does this PR do? - updated the notebooks to reflect past changes up to llama-stack 0.0.53 - updated readme to provide accurate and up-to-date info - improve the current zero to hero by integrating an example using together api ## Before submitting - [x] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [x] Ran pre-commit to handle lint / formatting issues. - [x] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests. --------- Co-authored-by: Sanyam Bhutani <sanyambhutani@meta.com>	2024-11-22 15:43:31 -08:00
Ashwin Bharambe	55c55b9f51	Update Quick Start significantly	2024-11-21 13:20:55 -08:00
Riandy	2108a779f2	Update kotlin client docs (#476 ) # What does this PR do? In short, provide a summary of what this PR does and why. Usually, the relevant context should be present in a linked issue. Add Kotlin package link into readme docs	2024-11-19 08:43:20 +05:30
Jeff Tang	15dee2b8b8	Added link to the Colab notebook of the Llama Stack lesson on the Llama 3.2 course on DLAI (#445 ) # What does this PR do? It shows a complete zero-setup Colab using the Llama Stack server implemented and powered by together.ai: using Llama Stack Client API to run inference, agent and 3.2 models. Good for a quick start guide. - [ ] Addresses issue (#issue) ## Test Plan Please describe: - tests you ran to verify your changes with result summaries. - provide instructions so it can be reproduced. ## Sources Please link relevant resources if necessary. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [ ] Ran pre-commit to handle lint / formatting issues. - [ ] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests.	2024-11-13 13:59:41 -08:00
Xi Yan	c810a4184d	[docs] update documentations (#356 ) * move docs -> source * Add files via upload * mv image * Add files via upload * colocate iOS setup doc * delete image * Add files via upload * fix * delete image * Add files via upload * Update developer_cookbook.md * toctree * wip subfolder * docs update * subfolder * updates * name * updates * index * updates * refactor structure * depth * docs * content * docs * getting started * distributions * fireworks * fireworks * update * theme * theme * theme * pdj theme * pytorch theme * css * theme * agents example * format * index * headers * copy button * test tabs * test tabs * fix * tabs * tab * tabs * sphinx_design * quick start commands * size * width * css * css * download models * asthetic fix * tab format * update * css * width * css * docs * tab based * tab * tabs * docs * style * image * css * color * typo * update docs * missing links * list templates * links * links update * troubleshooting * fix * distributions * docs * fix table * kill llamastack-local-gpu/cpu * Update index.md * Update index.md * mv ios_setup.md * Update ios_setup.md * Add remote_or_local.gif * Update ios_setup.md * release notes * typos * Add ios_setup to index * nav bar * hide torctree * ios image * links update * rename * rename * docs * rename * links * distributions * distributions * distributions * distributions * remove release * remote --------- Co-authored-by: dltn <6599399+dltn@users.noreply.github.com> Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>	2024-11-04 16:52:38 -08:00
Justin Lee	b6d8246b82	added templates and enhanced readme (#307 ) Co-authored-by: Justin Lee <justinai@fb.com>	2024-10-24 17:07:06 -07:00
Xi Yan	cb203b14b4	update README.md	2024-10-21 13:51:39 -07:00
Xi Yan	d787d1e84f	config templates restructure, docs (#262 ) * wip * config templates * readmes	2024-10-16 23:25:10 -07:00
Dalton Flanagan	9fbe8852aa	Add Swift Package Index badge	2024-10-10 23:39:25 -04:00
Dalton Flanagan	a3e65d58a9	Add logo	2024-10-10 15:04:21 -04:00
Xi Yan	16ba0fa06f	Update README.md	2024-10-07 11:24:27 -07:00
Russell Bryant	996efa9b42	README.md: Add vLLM to providers table (#207 ) Signed-off-by: Russell Bryant <russell.bryant@gmail.com>	2024-10-07 10:26:52 -07:00
Xi Yan	2366e18873	refactor docs (#209 )	2024-10-07 10:21:26 -07:00
raghotham	d74501f75c	Update README.md Added pypi package version	2024-10-03 10:21:16 -07:00
Xi Yan	f6a6598d1a	[bugfix] fix #146 (#147 ) * more robust image type * lint	2024-09-28 17:47:00 -07:00
Xi Yan	b646167d94	Update README.md	2024-09-28 16:55:22 -07:00
Xi Yan	5ce759adc4	Update README.md	2024-09-28 16:55:08 -07:00
JC (Jonathan Chen)	e73e9110b7	docs: fix typo (#107 )	2024-09-25 18:36:31 -07:00
Xi Yan	d0280138ef	Update README.md	2024-09-25 17:29:17 -07:00
machina-source	37be3fb184	Fix links & format (#104 ) Fix broken examples link to llama-stack-apps repo Remove extra space in README.md	2024-09-25 14:18:46 -07:00
raghotham	baf7bb47b9	Update README.md	2024-09-25 11:45:47 -07:00
Ashwin Bharambe	8d511cdf91	Make build_conda_env a bit more robust	2024-09-24 10:12:07 -07:00
Ashwin Bharambe	9487ad8294	API Updates (#73 ) * API Keys passed from Client instead of distro configuration * delete distribution registry * Rename the "package" word away * Introduce a "Router" layer for providers Some providers need to be factorized and considered as thin routing layers on top of other providers. Consider two examples: - The inference API should be a routing layer over inference providers, routed using the "model" key - The memory banks API is another instance where various memory bank types will be provided by independent providers (e.g., a vector store is served by Chroma while a keyvalue memory can be served by Redis or PGVector) This commit introduces a generalized routing layer for this purpose. * update `apis_to_serve` * llama_toolchain -> llama_stack * Codemod from llama_toolchain -> llama_stack - added providers/registry - cleaned up api/ subdirectories and moved impls away - restructured api/api.py - from llama_stack.apis.<api> import foo should work now - update imports to do llama_stack.apis.<api> - update many other imports - added __init__, fixed some registry imports - updated registry imports - create_agentic_system -> create_agent - AgenticSystem -> Agent * Moved some stuff out of common/; re-generated OpenAPI spec * llama-toolchain -> llama-stack (hyphens) * add control plane API * add redis adapter + sqlite provider * move core -> distribution * Some more toolchain -> stack changes * small naming shenanigans * Removing custom tool and agent utilities and moving them client side * Move control plane to distribution server for now * Remove control plane from API list * no codeshield dependency randomly plzzzzz * Add "fire" as a dependency * add back event loggers * stack configure fixes * use brave instead of bing in the example client * add init file so it gets packaged * add init files so it gets packaged * Update MANIFEST * bug fix --------- Co-authored-by: Hardik Shah <hjshah@fb.com> Co-authored-by: Xi Yan <xiyan@meta.com> Co-authored-by: Ashwin Bharambe <ashwin@meta.com>	2024-09-17 19:51:35 -07:00
raghotham	d8b3fdbd54	Update README.md	2024-09-13 08:56:47 -07:00
Hardik Shah	7f13853e5e	Update README.md	2024-08-12 17:10:02 -07:00
Dalton Flanagan	416097a9ea	Rename inline -> local (#24 ) * Rename the "inline" distribution to "local" * further rename --------- Co-authored-by: Ashwin Bharambe <ashwin.bharambe@gmail.com>	2024-08-08 17:39:03 -04:00
Ashwin Bharambe	e830814399	Introduce Llama stack distributions (#22 ) * Add distribution CLI scaffolding * More progress towards `llama distribution install` * getting closer to a distro definition, distro install + configure works * Distribution server now functioning * read existing configuration, save enums properly * Remove inference uvicorn server entrypoint and llama inference CLI command * updated dependency and client model name * Improved exception handling * local imports for faster cli * undo a typo, add a passthrough distribution * implement full-passthrough in the server * add safety adapters, configuration handling, server + clients * cleanup, moving stuff to common, nuke utils * Add a Path() wrapper at the earliest place * fixes * Bring agentic system api to toolchain Add adapter dependencies and resolve adapters using a topological sort * refactor to reduce size of `agentic_system` * move straggler files and fix some important existing bugs * ApiSurface -> Api * refactor a method out * Adapter -> Provider * Make each inference provider into its own subdirectory * installation fixes * Rename Distribution -> DistributionSpec, simplify RemoteProviders * dict key instead of attr * update inference config to take model and not model_dir * Fix passthrough streaming, send headers properly not part of body :facepalm * update safety to use model sku ids and not model dirs * Update cli_reference.md * minor fixes * add DistributionConfig, fix a bug in model download * Make install + start scripts do proper configuration automatically * Update CLI_reference * Nuke fp8_requirements, fold fbgemm into common requirements * Update README, add newline between API surface configurations * Refactor download functionality out of the Command so can be reused * Add `llama model download` alias for `llama download` * Show message about checksum file so users can check themselves * Simpler intro statements * get ollama working * Reduce a bunch of dependencies from toolchain Some improvements to the distribution install script * Avoid using `conda run` since it buffers everything * update dependencies and rely on LLAMA_TOOLCHAIN_DIR for dev purposes * add validation for configuration input * resort imports * make optional subclasses default to yes for configuration * Remove additional_pip_packages; move deps to providers * for inline make 8b model the default * Add scripts to MANIFEST * allow installing from test.pypi.org * Fix #2 to help with testing packages * Must install llama-models at that same version first * fix PIP_ARGS --------- Co-authored-by: Hardik Shah <hjshah@fb.com> Co-authored-by: Hardik Shah <hjshah@meta.com>	2024-08-08 13:38:41 -07:00
Dalton Flanagan	dd6c1f1e64	Add links to shields	2024-07-27 11:28:46 -04:00
Dalton Flanagan	b5d7cec11e	Add shields to README	2024-07-27 11:02:50 -04:00
Dalton Flanagan	ec433448f2	Add CLI reference docs (#14 ) * Add CLI reference doc * touchups * add helptext for download	2024-07-25 13:56:29 -07:00
Ashwin Bharambe	f7e053e3ba	Updates to setup and requirements for PyPI	2024-07-23 13:28:30 -07:00
Ashwin Bharambe	5d5acc8ed5	Initial commit	2024-07-23 08:32:33 -07:00

32 commits