mirror of https://github.com/meta-llama/llama-stack.git synced 2025-07-26 22:19:49 +00:00

History

Ashwin Bharambe 1463b79218 feat(registry): make the Stack query providers for model listing (#2862 ) This flips #2823 and #2805 by making the Stack periodically query the providers for models rather than the providers going behind the back and calling "register" on to the registry themselves. This also adds support for model listing for all other providers via `ModelRegistryHelper`. Once this is done, we do not need to manually list or register models via `run.yaml` and it will remove both noise and annoyance (setting `INFERENCE_MODEL` environment variables, for example) from the new user experience. In addition, it adds a configuration variable `allowed_models` which can be used to optionally restrict the set of models exposed from a provider.		2025-07-24 10:39:53 -07:00
..
_static	chore: Making name optional in openai_create_vector_store (#2858 )	2025-07-22 13:31:31 -04:00
notebooks	feat: Add Nvidia e2e beginner notebook and tool calling notebook (#1964 )	2025-06-16 11:29:01 -04:00
openapi_generator	feat: Add webmethod for deleting openai responses (#2160 )	2025-06-30 11:28:02 +02:00
resources	Several documentation fixes and fix link to API reference	2025-02-04 14:00:43 -08:00
source	feat(registry): make the Stack query providers for model listing (#2862 )	2025-07-24 10:39:53 -07:00
zero_to_hero_guide	feat: consolidate most distros into "starter" (#2516 )	2025-07-04 15:58:03 +02:00
conftest.py	fix: sleep after notebook test	2025-03-23 14:03:35 -07:00
contbuild.sh	Fix broken links with docs	2024-11-22 20:42:17 -08:00
dog.jpg	Support for Llama3.2 models and Swift SDK (#98 )	2024-09-25 10:29:58 -07:00
getting_started.ipynb	docs: Add quick_start.ipynb notebook equivalent of index.md Quickstart guide (#2128 )	2025-07-03 13:55:43 +02:00
getting_started_llama4.ipynb	docs: update docs to use "starter" than "ollama" (#2629 )	2025-07-05 08:44:57 +05:30
getting_started_llama_api.ipynb	docs: Add quick_start.ipynb notebook equivalent of index.md Quickstart guide (#2128 )	2025-07-03 13:55:43 +02:00
license_header.txt	Initial commit	2024-07-23 08:32:33 -07:00
make.bat	feat(pre-commit): enhance pre-commit hooks with additional checks (#2014 )	2025-04-30 11:35:49 -07:00
Makefile	first version of readthedocs (#278 )	2024-10-22 10:15:58 +05:30
original_rfc.md	chore: remove "rfc" directory and move original rfc to "docs" (#2718 )	2025-07-10 14:06:10 -07:00
quick_start.ipynb	docs: update docs to use "starter" than "ollama" (#2629 )	2025-07-05 08:44:57 +05:30
readme.md	chore: use groups when running commands (#2298 )	2025-05-28 09:13:16 -07:00

readme.md

Llama Stack Documentation

Here's a collection of comprehensive guides, examples, and resources for building AI applications with Llama Stack. For the complete documentation, visit our ReadTheDocs page.

Render locally

From the llama-stack root directory, run the following command to render the docs locally:

uv run --group docs sphinx-autobuild docs/source docs/build/html --write-all

You can open up the docs in your browser at http://localhost:8000

Content

Try out Llama Stack's capabilities through our detailed Jupyter notebooks:

Building AI Applications Notebook - A comprehensive guide to building production-ready AI applications using Llama Stack
Benchmark Evaluations Notebook - Detailed performance evaluations and benchmarking results
Zero-to-Hero Guide - Step-by-step guide for getting started with Llama Stack