llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-06-27 18:50:41 +00:00

Author	SHA1	Message	Date
Ashwin Bharambe	46b0a404e8	chore: remove straggler references to llama-models (#1345 ) Straggler references cleanup	2025-03-01 14:26:03 -08:00
Yuan Tang	264c2c46db	build: Add dotenv file for running tests with uv (#1251 ) This will be useful for testing instead of having to manually pass them every time. --------- Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>	2025-02-27 16:42:55 -08:00
Yuan Tang	6634864b19	docs: Add missing uv command and clarify website rebuild (#1199 ) # What does this PR do? This fixes the following error: ``` $ make html /bin/sh: line 1: sphinx-build: command not found make: *** [Makefile:20: html] Error 127 ``` Also clarifies that this command only rebuilds the website without watching/refreshes. ## Test Plan New command works. --------- Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>	2025-02-21 11:29:32 -05:00
Yuan Tang	35de423556	docs: Add missing uv command for docs generation in contributing guide (#1197 ) # What does this PR do? ``` make html /bin/sh: line 1: sphinx-build: command not found make: *** [Makefile:20: html] Error 127 ``` ## Test Plan Tested the command `uv run ./docs/openapi_generator/run_openapi_generator.sh` successfully.	2025-02-20 21:05:03 -08:00
Sébastien Han	7504cb16c6	docs: improve API contribution guidelines (#1137 ) # What does this PR do? Clarify when to update documentation, explain `uv sync --extra dev` and OpenAPI generation, and specify where generated docs are stored. Signed-off-by: Sébastien Han <seb@redhat.com> Signed-off-by: Sébastien Han <seb@redhat.com>	2025-02-19 22:14:04 -08:00
Ben Browning	8c01b7f05a	docs: Mention convential commits format in CONTRIBUTING.md (#1075 ) # What does this PR do? This adds a note to ensure pull requests follow the conventional commits format, along with a link to that format, in CONTRIBUTING.md. One of the pull-request checks enforces PR titles that match this format, so it's good to be upfront about this expectation before a new developer opens a PR. Signed-off-by: Ben Browning <bbrownin@redhat.com>	2025-02-13 10:57:30 -05:00
Ihar Hrachyshka	6ad272927d	docs: reflect actual number of spaces for indent (#1052 ) For what I see, it's all 4 spaces (as it should be for pep8[1]). [1] https://peps.python.org/pep-0008/#indentation # What does this PR do? Reflect indent reality.	2025-02-11 14:07:26 -08:00
Sébastien Han	a764b823ee	docs: use uv in CONTRIBUTING guide (#970 ) # What does this PR do? Switch to uv for dependency management and update CONTRIBUTING.md with new setup instructions. Add missing dev dependencies to pyproject.toml and apply minor formatting fixes. Signed-off-by: Sébastien Han <seb@redhat.com> - [ ] Addresses issue (#issue) ## Test Plan Please describe: - tests you ran to verify your changes with result summaries. - provide instructions so it can be reproduced. ## Sources Please link relevant resources if necessary. ## Before submitting - [x] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [ ] Ran pre-commit to handle lint / formatting issues. - [ ] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests. Signed-off-by: Sébastien Han <seb@redhat.com>	2025-02-06 10:21:27 -08:00
Ashwin Bharambe	d123e9d3d7	Update docs for RAG and improve CONTRIBUTING.md	2025-01-28 06:09:48 -08:00
Ashwin Bharambe	2118f37350	Doc updates	2025-01-23 21:31:18 -08:00
Ashwin Bharambe	14c75c3f21	Update CONTRIBUTING to include info about pre-commit	2024-11-18 18:17:54 -08:00
Ashwin Bharambe	2a31163178	Auto-generate distro yamls + docs (#468 ) # What does this PR do? Automatically generates - build.yaml - run.yaml - run-with-safety.yaml - parts of markdown docs for the distributions. ## Test Plan At this point, this only updates the YAMLs and the docs. Some testing (especially with ollama and vllm) has been performed but needs to be much more tested.	2024-11-18 14:57:06 -08:00
Xi Yan	8350f2df4c	[docs] refactor remote-hosted distro (#402 ) * move docs * docs	2024-11-07 19:16:38 -08:00
Xi Yan	657de08f04	precommit	2024-11-04 19:01:56 -08:00
Xi Yan	8927da6566	instructions on contributing to readthedocs	2024-11-04 18:58:07 -08:00
Xi Yan	2366e18873	refactor docs (#209 )	2024-10-07 10:21:26 -07:00
Ashwin Bharambe	e830814399	Introduce Llama stack distributions (#22 ) * Add distribution CLI scaffolding * More progress towards `llama distribution install` * getting closer to a distro definition, distro install + configure works * Distribution server now functioning * read existing configuration, save enums properly * Remove inference uvicorn server entrypoint and llama inference CLI command * updated dependency and client model name * Improved exception handling * local imports for faster cli * undo a typo, add a passthrough distribution * implement full-passthrough in the server * add safety adapters, configuration handling, server + clients * cleanup, moving stuff to common, nuke utils * Add a Path() wrapper at the earliest place * fixes * Bring agentic system api to toolchain Add adapter dependencies and resolve adapters using a topological sort * refactor to reduce size of `agentic_system` * move straggler files and fix some important existing bugs * ApiSurface -> Api * refactor a method out * Adapter -> Provider * Make each inference provider into its own subdirectory * installation fixes * Rename Distribution -> DistributionSpec, simplify RemoteProviders * dict key instead of attr * update inference config to take model and not model_dir * Fix passthrough streaming, send headers properly not part of body :facepalm * update safety to use model sku ids and not model dirs * Update cli_reference.md * minor fixes * add DistributionConfig, fix a bug in model download * Make install + start scripts do proper configuration automatically * Update CLI_reference * Nuke fp8_requirements, fold fbgemm into common requirements * Update README, add newline between API surface configurations * Refactor download functionality out of the Command so can be reused * Add `llama model download` alias for `llama download` * Show message about checksum file so users can check themselves * Simpler intro statements * get ollama working * Reduce a bunch of dependencies from toolchain Some improvements to the distribution install script * Avoid using `conda run` since it buffers everything * update dependencies and rely on LLAMA_TOOLCHAIN_DIR for dev purposes * add validation for configuration input * resort imports * make optional subclasses default to yes for configuration * Remove additional_pip_packages; move deps to providers * for inline make 8b model the default * Add scripts to MANIFEST * allow installing from test.pypi.org * Fix #2 to help with testing packages * Must install llama-models at that same version first * fix PIP_ARGS --------- Co-authored-by: Hardik Shah <hjshah@fb.com> Co-authored-by: Hardik Shah <hjshah@meta.com>	2024-08-08 13:38:41 -07:00
Ashwin Bharambe	5d5acc8ed5	Initial commit	2024-07-23 08:32:33 -07:00

18 commits