llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-07-21 03:59:42 +00:00

Author	SHA1	Message	Date
IAN MILLER	fa1bb9ae00	docs: fix typo and link self loop for index.html#running-tests (#2777 ) # What does this PR do? <!-- Provide a short summary of what this PR does and why. Link to relevant issues if applicable. --> This PR fixes typo "here here" and self loop link at [https://llama-stack.readthedocs.io/en/latest/contributing/index.html#tests/README.md](https://llama-stack.readthedocs.io/en/latest/contributing/index.html#tests/README.md) <!-- If resolving an issue, uncomment and update the line below --> <!-- Closes #[issue-number] --> Closes #2762 ## Test Plan <!-- Describe the tests you ran to verify your changes with result summaries. Provide clear instructions so the plan can be easily re-executed. -->	2025-07-16 07:09:44 -07:00
Nathan Weinberg	0bbff91c7e	docs: fix a few broken things in the CONTRIBUTING.md (#2714 ) # What does this PR do? "dev" dependencies were moved in pyproject.toml typo with guidance around automatic doc generation Signed-off-by: Nathan Weinberg <nweinber@redhat.com>	2025-07-10 11:47:54 -07:00
Sébastien Han	c9a49a80e8	docs: auto generated documentation for providers (#2543 ) # What does this PR do? Simple approach to get some provider pages in the docs. Add or update description fields in the provider configuration class using Pydantic’s Field, ensuring these descriptions are clear and complete, as they will be used to auto-generate provider documentation via ./scripts/distro_codegen.py instead of editing the docs manually. Signed-off-by: Sébastien Han <seb@redhat.com>	2025-06-30 15:13:20 +02:00
Sébastien Han	9c8be89fb6	chore: bump python supported version to 3.12 (#2475 ) Some checks failed Integration Tests / test-matrix (library, 3.12, providers) (push) Failing after 7s Details Integration Tests / test-matrix (library, 3.12, inspect) (push) Failing after 12s Details Integration Tests / test-matrix (library, 3.12, datasets) (push) Failing after 16s Details Test Llama Stack Build / build-single-provider (push) Failing after 9s Details Integration Tests / test-matrix (library, 3.13, inference) (push) Failing after 10s Details Integration Tests / test-matrix (library, 3.12, agents) (push) Failing after 7s Details Python Package Build Test / build (3.13) (push) Failing after 5s Details Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 7s Details Integration Tests / test-matrix (http, 3.13, datasets) (push) Failing after 14s Details Integration Tests / test-matrix (library, 3.12, tool_runtime) (push) Failing after 15s Details Integration Tests / test-matrix (library, 3.13, agents) (push) Failing after 14s Details Integration Tests / test-matrix (library, 3.13, datasets) (push) Failing after 11s Details Integration Tests / test-matrix (library, 3.13, vector_io) (push) Failing after 10s Details Integration Tests / test-matrix (library, 3.13, scoring) (push) Failing after 11s Details Integration Tests / test-matrix (library, 3.13, post_training) (push) Failing after 12s Details Integration Tests / test-matrix (library, 3.12, inference) (push) Failing after 12s Details Integration Tests / test-matrix (http, 3.13, providers) (push) Failing after 13s Details Integration Tests / test-matrix (library, 3.12, vector_io) (push) Failing after 14s Details Integration Tests / test-matrix (library, 3.13, tool_runtime) (push) Failing after 7s Details Integration Tests / test-matrix (library, 3.12, post_training) (push) Failing after 11s Details Unit Tests / unit-tests (3.12) (push) Failing after 7s Details Integration Tests / test-matrix (library, 3.13, inspect) (push) Failing after 6s Details Update ReadTheDocs / update-readthedocs (push) Failing after 5s Details Unit Tests / unit-tests (3.13) (push) Failing after 8s Details Test Llama Stack Build / build (push) Failing after 6s Details Integration Tests / test-matrix (library, 3.13, providers) (push) Failing after 41s Details Python Package Build Test / build (3.12) (push) Failing after 33s Details Test Llama Stack Build / build-custom-container-distribution (push) Failing after 36s Details Test External Providers / test-external-providers (venv) (push) Failing after 31s Details Pre-commit / pre-commit (push) Successful in 1m54s Details # What does this PR do? The project now supports Python >= 3.12 Signed-off-by: Sébastien Han <seb@redhat.com>	2025-06-24 09:22:04 +05:30
Nathan Weinberg	179d72615b	docs: update contributing guidance around uv python versions (#2398 ) As discussed with @leseb here: https://github.com/containers/ramalama-stack/pull/81#discussion_r2125961014 Signed-off-by: Nathan Weinberg <nweinber@redhat.com>	2025-06-04 23:12:03 -07:00
Sébastien Han	6352078e4b	chore: use groups when running commands (#2298 ) # What does this PR do? Followup of https://github.com/meta-llama/llama-stack/pull/2287. We must use `--group` when running commands with uv. <!-- Provide a short summary of what this PR does and why. Link to relevant issues if applicable. --> <!-- If resolving an issue, uncomment and update the line below --> <!-- Closes #[issue-number] --> Signed-off-by: Sébastien Han <seb@redhat.com>	2025-05-28 09:13:16 -07:00
Sébastien Han	85b5f3172b	docs: misc cleanup (#2223 ) # What does this PR do? * remove requirements.txt to use pyproject.toml as the source of truth * update relevant docs Signed-off-by: Sébastien Han <seb@redhat.com>	2025-05-21 17:35:27 +02:00
Nathan Weinberg	e0d10dd0b1	docs: revamp testing documentation (#2155 ) # What does this PR do? reduces duplication and centralizes information to be easier to find for contributors Signed-off-by: Nathan Weinberg <nweinber@redhat.com>	2025-05-13 11:28:29 -07:00
Derek Higgins	6f1badc934	test: Document how users can run a subset of tests (#2066 ) ## Test Plan N/A Signed-off-by: Derek Higgins <derekh@redhat.com>	2025-05-07 14:05:36 +02:00
Sébastien Han	7377a5c83e	docs: contrib add a note about unicode in code (#2106 ) # What does this PR do? Don't use unicode characters in the codebase. ASCII-only is preferred for compatibility or readability reasons Signed-off-by: Sébastien Han <seb@redhat.com>	2025-05-06 09:50:30 -07:00
Sébastien Han	a4247ce0a8	docs: expand contribution guidelines for linting exceptions (#2101 ) # What does this PR do? - Clarified best practices for using `# noqa` and `# type: ignore`, requiring justification comments - Improved formatting for readability Signed-off-by: Sébastien Han <seb@redhat.com>	2025-05-05 02:36:30 -07:00
Yuan Tang	7e51a83eac	docs: Add link to integration tests instructions and minor clarification (#1838 ) # What does this PR do? * Added `--text-model` in example command. * Added link to integration tests instruction and a note on specifying models. This is to avoid confusion when all tests are skipped because no model is provided. Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>	2025-03-31 11:37:42 +02:00
Yuan Tang	bdfe7fee92	docs: Add more env vars in dotenv instructions (#1791 ) # What does this PR do? Added more hint on `LLAMA_STACK_CONFIG` and API keys necessary for agent tests.	2025-03-25 20:03:21 -07:00
Sébastien Han	4c14bb7510	docs: fix change dir command (#1752 ) # What does this PR do? We are already in the llama-stack git directory. Signed-off-by: Sébastien Han <seb@redhat.com>	2025-03-21 12:00:09 -07:00
Sébastien Han	636d97207f	docs: propose new contribution guidance (#1750 ) # What does this PR do? Propose new contribution guidance. Signed-off-by: Sébastien Han <seb@redhat.com>	2025-03-21 09:08:02 -07:00
Nathan Weinberg	141b3c14dd	docs: fix broken test path in CONTRIBUTING.md (#1679 ) # What does this PR do? fix broken test path in CONTRIBUTING.md Signed-off-by: Nathan Weinberg <nweinber@redhat.com>	2025-03-18 13:39:46 -07:00
Ihar Hrachyshka	77ca09467f	chore: consolidate scripts under ./scripts directory (#1646 )	2025-03-17 17:56:30 -04:00
Ihar Hrachyshka	bfc79217a8	chore: Add ./scripts/unit-tests.sh (#1515 ) # What does this PR do? Useful for local development. Now you can just trigger the script and not care about specific arguments to pass to run unit tests. [//]: # (If resolving an issue, uncomment and update the line below) [//]: # (Closes #[issue-number]) ## Test Plan ``` $ . ./venv/bin/activate $ ./scripts/run_tests.sh $ echo $? 0 ``` [//]: # (## Documentation) Signed-off-by: Ihar Hrachyshka <ihar.hrachyshka@gmail.com> Co-authored-by: Nathan Weinberg <31703736+nathan-weinberg@users.noreply.github.com>	2025-03-13 20:25:15 -07:00
Nathan Weinberg	d263edbf90	build: remove .python-version (#1513 ) # What does this PR do? the current `.python-version` file forces `uv` to setup the development environment with Python 3.10 this causes an error if a dev system does not have Python 3.10, even though the project officially supports newer versions of Python as well since `uv` can use the `pyproject.toml` to determine python versions, we can safely remove this file from the repo and subsequent git tracking follows up on https://github.com/meta-llama/llama-stack/pull/1172 ## Test Plan N/A --------- Signed-off-by: Nathan Weinberg <nweinber@redhat.com>	2025-03-12 20:08:24 -07:00
Ihar Hrachyshka	04106b94aa	docs: Remove duplicate docs on api docs generator (#1534 ) # What does this PR do? Since #892, we also need to install ruamel. Instead of maintaining the list of script dependencies in multiple places, remove it and assume developers read CONTRIBUTING.md docs. [//]: # (If resolving an issue, uncomment and update the line below) [//]: # (Closes #[issue-number]) ## Test Plan Just docs. [//]: # (## Documentation) Signed-off-by: Ihar Hrachyshka <ihar.hrachyshka@gmail.com>	2025-03-11 10:01:46 -07:00
Ellis Tarn	24a27baf7c	chore: Make README code blocks more easily copy pastable (#1420 ) # What does this PR do? When going through READMEs, I found that I had to keep editing the code blocks since they were prefixed with `$ `. A common pattern is to triple click (highlight all) a block and then copy paste. This minor change will make this easier for folks to follow the READMEs. [//]: # (If resolving an issue, uncomment and update the line below) [//]: # (Closes #[issue-number]) ## Test Plan N/A [//]: # (## Documentation)	2025-03-05 09:11:01 -08:00
Ashwin Bharambe	5736c7d682	refactor: move tests/client-sdk to tests/api (#1376 ) This PR moves the client-sdk tests to the api directory to better reflect their purpose and improve code organization.	2025-03-03 17:28:12 -08:00
Ashwin Bharambe	46b0a404e8	chore: remove straggler references to llama-models (#1345 ) Straggler references cleanup	2025-03-01 14:26:03 -08:00
Yuan Tang	264c2c46db	build: Add dotenv file for running tests with uv (#1251 ) This will be useful for testing instead of having to manually pass them every time. --------- Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>	2025-02-27 16:42:55 -08:00
Yuan Tang	6634864b19	docs: Add missing uv command and clarify website rebuild (#1199 ) # What does this PR do? This fixes the following error: ``` $ make html /bin/sh: line 1: sphinx-build: command not found make: *** [Makefile:20: html] Error 127 ``` Also clarifies that this command only rebuilds the website without watching/refreshes. ## Test Plan New command works. --------- Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>	2025-02-21 11:29:32 -05:00
Yuan Tang	35de423556	docs: Add missing uv command for docs generation in contributing guide (#1197 ) # What does this PR do? ``` make html /bin/sh: line 1: sphinx-build: command not found make: *** [Makefile:20: html] Error 127 ``` ## Test Plan Tested the command `uv run ./docs/openapi_generator/run_openapi_generator.sh` successfully.	2025-02-20 21:05:03 -08:00
Sébastien Han	7504cb16c6	docs: improve API contribution guidelines (#1137 ) # What does this PR do? Clarify when to update documentation, explain `uv sync --extra dev` and OpenAPI generation, and specify where generated docs are stored. Signed-off-by: Sébastien Han <seb@redhat.com> Signed-off-by: Sébastien Han <seb@redhat.com>	2025-02-19 22:14:04 -08:00
Ben Browning	8c01b7f05a	docs: Mention convential commits format in CONTRIBUTING.md (#1075 ) # What does this PR do? This adds a note to ensure pull requests follow the conventional commits format, along with a link to that format, in CONTRIBUTING.md. One of the pull-request checks enforces PR titles that match this format, so it's good to be upfront about this expectation before a new developer opens a PR. Signed-off-by: Ben Browning <bbrownin@redhat.com>	2025-02-13 10:57:30 -05:00
Ihar Hrachyshka	6ad272927d	docs: reflect actual number of spaces for indent (#1052 ) For what I see, it's all 4 spaces (as it should be for pep8[1]). [1] https://peps.python.org/pep-0008/#indentation # What does this PR do? Reflect indent reality.	2025-02-11 14:07:26 -08:00
Sébastien Han	a764b823ee	docs: use uv in CONTRIBUTING guide (#970 ) # What does this PR do? Switch to uv for dependency management and update CONTRIBUTING.md with new setup instructions. Add missing dev dependencies to pyproject.toml and apply minor formatting fixes. Signed-off-by: Sébastien Han <seb@redhat.com> - [ ] Addresses issue (#issue) ## Test Plan Please describe: - tests you ran to verify your changes with result summaries. - provide instructions so it can be reproduced. ## Sources Please link relevant resources if necessary. ## Before submitting - [x] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [ ] Ran pre-commit to handle lint / formatting issues. - [ ] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests. Signed-off-by: Sébastien Han <seb@redhat.com>	2025-02-06 10:21:27 -08:00
Ashwin Bharambe	d123e9d3d7	Update docs for RAG and improve CONTRIBUTING.md	2025-01-28 06:09:48 -08:00
Ashwin Bharambe	2118f37350	Doc updates	2025-01-23 21:31:18 -08:00
Ashwin Bharambe	14c75c3f21	Update CONTRIBUTING to include info about pre-commit	2024-11-18 18:17:54 -08:00
Ashwin Bharambe	2a31163178	Auto-generate distro yamls + docs (#468 ) # What does this PR do? Automatically generates - build.yaml - run.yaml - run-with-safety.yaml - parts of markdown docs for the distributions. ## Test Plan At this point, this only updates the YAMLs and the docs. Some testing (especially with ollama and vllm) has been performed but needs to be much more tested.	2024-11-18 14:57:06 -08:00
Xi Yan	8350f2df4c	[docs] refactor remote-hosted distro (#402 ) * move docs * docs	2024-11-07 19:16:38 -08:00
Xi Yan	657de08f04	precommit	2024-11-04 19:01:56 -08:00
Xi Yan	8927da6566	instructions on contributing to readthedocs	2024-11-04 18:58:07 -08:00
Xi Yan	2366e18873	refactor docs (#209 )	2024-10-07 10:21:26 -07:00
Ashwin Bharambe	e830814399	Introduce Llama stack distributions (#22 ) * Add distribution CLI scaffolding * More progress towards `llama distribution install` * getting closer to a distro definition, distro install + configure works * Distribution server now functioning * read existing configuration, save enums properly * Remove inference uvicorn server entrypoint and llama inference CLI command * updated dependency and client model name * Improved exception handling * local imports for faster cli * undo a typo, add a passthrough distribution * implement full-passthrough in the server * add safety adapters, configuration handling, server + clients * cleanup, moving stuff to common, nuke utils * Add a Path() wrapper at the earliest place * fixes * Bring agentic system api to toolchain Add adapter dependencies and resolve adapters using a topological sort * refactor to reduce size of `agentic_system` * move straggler files and fix some important existing bugs * ApiSurface -> Api * refactor a method out * Adapter -> Provider * Make each inference provider into its own subdirectory * installation fixes * Rename Distribution -> DistributionSpec, simplify RemoteProviders * dict key instead of attr * update inference config to take model and not model_dir * Fix passthrough streaming, send headers properly not part of body :facepalm * update safety to use model sku ids and not model dirs * Update cli_reference.md * minor fixes * add DistributionConfig, fix a bug in model download * Make install + start scripts do proper configuration automatically * Update CLI_reference * Nuke fp8_requirements, fold fbgemm into common requirements * Update README, add newline between API surface configurations * Refactor download functionality out of the Command so can be reused * Add `llama model download` alias for `llama download` * Show message about checksum file so users can check themselves * Simpler intro statements * get ollama working * Reduce a bunch of dependencies from toolchain Some improvements to the distribution install script * Avoid using `conda run` since it buffers everything * update dependencies and rely on LLAMA_TOOLCHAIN_DIR for dev purposes * add validation for configuration input * resort imports * make optional subclasses default to yes for configuration * Remove additional_pip_packages; move deps to providers * for inline make 8b model the default * Add scripts to MANIFEST * allow installing from test.pypi.org * Fix #2 to help with testing packages * Must install llama-models at that same version first * fix PIP_ARGS --------- Co-authored-by: Hardik Shah <hjshah@fb.com> Co-authored-by: Hardik Shah <hjshah@meta.com>	2024-08-08 13:38:41 -07:00
Ashwin Bharambe	5d5acc8ed5	Initial commit	2024-07-23 08:32:33 -07:00

40 commits