mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-04 04:04:14 +00:00

History

skamenan7 17fbd21c0d feat(vector-io): implement global default embedding model configuration (Issue #2729 ) - Add VectorStoreConfig with global default_embedding_model and default_embedding_dimension - Support environment variables LLAMA_STACK_DEFAULT_EMBEDDING_MODEL and LLAMA_STACK_DEFAULT_EMBEDDING_DIMENSION - Implement precedence: explicit model > global default > clear error (no fallback) - Update VectorIORouter with _resolve_embedding_model() precedence logic - Remove non-deterministic 'first model in run.yaml' fallback behavior - Add vector_store_config to StackRunConfig and all distribution templates - Include comprehensive unit tests for config loading and router precedence - Update documentation with configuration examples and usage patterns - Fix error messages to include 'Failed to' prefix per coding standards Resolves deterministic vector store creation by eliminating unpredictable fallbacks and providing clear configuration options at the stack level.		2025-09-18 10:11:44 -04:00
..
_static	feat: create HTTP DELETE API endpoints to unregister ScoringFn and Benchmark resources in Llama Stack (#3371 )	2025-09-15 12:43:38 -07:00
notebooks	fix: Set provider_id in NVIDIA notebook when registering dataset (#3472 )	2025-09-17 11:45:15 -07:00
openapi_generator	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
resources	Several documentation fixes and fix link to API reference	2025-02-04 14:00:43 -08:00
source	feat(vector-io): implement global default embedding model configuration (Issue #2729 )	2025-09-18 10:11:44 -04:00
zero_to_hero_guide	docs: update documentation links (#3459 )	2025-09-17 10:37:35 -07:00
conftest.py	fix: sleep after notebook test	2025-03-23 14:03:35 -07:00
contbuild.sh	Fix broken links with docs	2024-11-22 20:42:17 -08:00
dog.jpg	Support for Llama3.2 models and Swift SDK (#98 )	2024-09-25 10:29:58 -07:00
getting_started.ipynb	docs: update documentation links (#3459 )	2025-09-17 10:37:35 -07:00
getting_started_llama4.ipynb	docs: update documentation links (#3459 )	2025-09-17 10:37:35 -07:00
getting_started_llama_api.ipynb	docs: update documentation links (#3459 )	2025-09-17 10:37:35 -07:00
license_header.txt	Initial commit	2024-07-23 08:32:33 -07:00
make.bat	feat(pre-commit): enhance pre-commit hooks with additional checks (#2014 )	2025-04-30 11:35:49 -07:00
Makefile	first version of readthedocs (#278 )	2024-10-22 10:15:58 +05:30
original_rfc.md	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
quick_start.ipynb	docs: update documentation links (#3459 )	2025-09-17 10:37:35 -07:00
README.md	docs: update documentation links (#3459 )	2025-09-17 10:37:35 -07:00

README.md

Llama Stack Documentation

Here's a collection of comprehensive guides, examples, and resources for building AI applications with Llama Stack. For the complete documentation, visit our Github page.

Render locally

From the llama-stack root directory, run the following command to render the docs locally:

uv run --group docs sphinx-autobuild docs/source docs/build/html --write-all

You can open up the docs in your browser at http://localhost:8000

Content

Try out Llama Stack's capabilities through our detailed Jupyter notebooks:

Building AI Applications Notebook - A comprehensive guide to building production-ready AI applications using Llama Stack
Benchmark Evaluations Notebook - Detailed performance evaluations and benchmarking results
Zero-to-Hero Guide - Step-by-step guide for getting started with Llama Stack