mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-27 19:02:01 +00:00

History

Ben Browning fa34468308 feat: File search tool for Responses API This is an initial working prototype of wiring up the `file_search` builtin tool for the Responses API to our existing rag knowledge search tool. I stubbed in a new test (that uses a hardcoded url hybrid of the OpenAI and Llama Stack clients for now, only until we finish landing the vector store APIs and insertion support). Note that this is currently under tests/verification only because it sometimes flakes with tool calling of the small Llama-3.2-3B model we run in CI (and that I use as an example below). We'd want to make the test a bit more robust in some way if we moved this over to tests/integration and ran it in CI. ``` ollama run llama3.2:3b INFERENCE_MODEL="meta-llama/Llama-3.2-3B-Instruct" \ llama stack run ./llama_stack/templates/ollama/run.yaml \ --image-type venv \ --env OLLAMA_URL="http://0.0.0.0:11434" pytest -sv 'tests/verifications/openai_api/test_responses.py::test_response_non_streaming_file_search' \ --base-url=http://localhost:8321/v1/openai/v1 \ --model meta-llama/Llama-3.2-3B-Instruct ``` Signed-off-by: Ben Browning <bbrownin@redhat.com>		2025-06-13 09:36:04 -04:00
..
_static	feat: File search tool for Responses API	2025-06-13 09:36:04 -04:00
notebooks	docs: fix evals notebook preview (#2277 )	2025-05-27 15:18:20 +02:00
openapi_generator	feat: openai files api (#2321 )	2025-06-02 11:45:53 -07:00
resources	Several documentation fixes and fix link to API reference	2025-02-04 14:00:43 -08:00
source	feat(auth): allow token to be provided for use against jwks endpoint (#2394 )	2025-06-13 10:13:41 +02:00
zero_to_hero_guide	feat: add additional logging to llama stack build (#1689 )	2025-04-30 11:06:24 -07:00
conftest.py	fix: sleep after notebook test	2025-03-23 14:03:35 -07:00
contbuild.sh	Fix broken links with docs	2024-11-22 20:42:17 -08:00
dog.jpg	Support for Llama3.2 models and Swift SDK (#98 )	2024-09-25 10:29:58 -07:00
getting_started.ipynb	chore: remove last instances of code-interpreter provider (#2143 )	2025-05-12 10:54:43 -07:00
getting_started_llama4.ipynb	docs: llama4 getting started nb (#1878 )	2025-04-06 18:51:34 -07:00
getting_started_llama_api.ipynb	feat: add api.llama provider, llama-guard-4 model (#2058 )	2025-04-29 10:07:41 -07:00
license_header.txt	Initial commit	2024-07-23 08:32:33 -07:00
make.bat	feat(pre-commit): enhance pre-commit hooks with additional checks (#2014 )	2025-04-30 11:35:49 -07:00
Makefile	first version of readthedocs (#278 )	2024-10-22 10:15:58 +05:30
readme.md	chore: use groups when running commands (#2298 )	2025-05-28 09:13:16 -07:00

readme.md

Llama Stack Documentation

Here's a collection of comprehensive guides, examples, and resources for building AI applications with Llama Stack. For the complete documentation, visit our ReadTheDocs page.

Render locally

From the llama-stack root directory, run the following command to render the docs locally:

uv run --group docs sphinx-autobuild docs/source docs/build/html --write-all

You can open up the docs in your browser at http://localhost:8000

Content

Try out Llama Stack's capabilities through our detailed Jupyter notebooks:

Building AI Applications Notebook - A comprehensive guide to building production-ready AI applications using Llama Stack
Benchmark Evaluations Notebook - Detailed performance evaluations and benchmarking results
Zero-to-Hero Guide - Step-by-step guide for getting started with Llama Stack