mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-07 10:50:56 +00:00

History

Ben Browning 8bee2954be feat: Structured output for Responses API (#2324 ) # What does this PR do? This adds the missing `text` parameter to the Responses API that is how users control structured outputs. All we do with that parameter is map it to the corresponding chat completion response_format. ## Test Plan The new unit tests exercise the various permutations allowed for this property, while a couple of new verification tests actually use it for real to verify the model outputs are following the format as expected. Unit tests: `python -m pytest -s -v tests/unit/providers/agents/meta_reference/test_openai_responses.py` Verification tests: ``` llama stack run llama_stack/templates/together/run.yaml pytest -s -vv 'tests/verifications/openai_api/test_responses.py' \ --base-url=http://localhost:8321/v1/openai/v1 \ --model meta-llama/Llama-4-Scout-17B-16E-Instruct ``` Note that the verification tests can only be run with a real Llama Stack server (as opposed to using the library client via `--provider=stack:together`) because the Llama Stack python client is not yet updated to accept this text field. Signed-off-by: Ben Browning <bbrownin@redhat.com>		2025-06-03 14:43:00 -07:00
..
_static	feat: Structured output for Responses API (#2324 )	2025-06-03 14:43:00 -07:00
notebooks	docs: fix evals notebook preview (#2277 )	2025-05-27 15:18:20 +02:00
openapi_generator	feat: openai files api (#2321 )	2025-06-02 11:45:53 -07:00
resources	Several documentation fixes and fix link to API reference	2025-02-04 14:00:43 -08:00
source	docs: Add missing dependencies in quickstart demo command (#2347 )	2025-06-03 18:01:36 +02:00
zero_to_hero_guide	feat: add additional logging to llama stack build (#1689 )	2025-04-30 11:06:24 -07:00
conftest.py	fix: sleep after notebook test	2025-03-23 14:03:35 -07:00
contbuild.sh	Fix broken links with docs	2024-11-22 20:42:17 -08:00
dog.jpg	Support for Llama3.2 models and Swift SDK (#98 )	2024-09-25 10:29:58 -07:00
getting_started.ipynb	chore: remove last instances of code-interpreter provider (#2143 )	2025-05-12 10:54:43 -07:00
getting_started_llama4.ipynb	docs: llama4 getting started nb (#1878 )	2025-04-06 18:51:34 -07:00
getting_started_llama_api.ipynb	feat: add api.llama provider, llama-guard-4 model (#2058 )	2025-04-29 10:07:41 -07:00
license_header.txt	Initial commit	2024-07-23 08:32:33 -07:00
make.bat	feat(pre-commit): enhance pre-commit hooks with additional checks (#2014 )	2025-04-30 11:35:49 -07:00
Makefile	first version of readthedocs (#278 )	2024-10-22 10:15:58 +05:30
readme.md	chore: use groups when running commands (#2298 )	2025-05-28 09:13:16 -07:00

readme.md

Llama Stack Documentation

Here's a collection of comprehensive guides, examples, and resources for building AI applications with Llama Stack. For the complete documentation, visit our ReadTheDocs page.

Render locally

From the llama-stack root directory, run the following command to render the docs locally:

uv run --group docs sphinx-autobuild docs/source docs/build/html --write-all

You can open up the docs in your browser at http://localhost:8000

Content

Try out Llama Stack's capabilities through our detailed Jupyter notebooks:

Building AI Applications Notebook - A comprehensive guide to building production-ready AI applications using Llama Stack
Benchmark Evaluations Notebook - Detailed performance evaluations and benchmarking results
Zero-to-Hero Guide - Step-by-step guide for getting started with Llama Stack