mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-06 02:30:58 +00:00

History

Charlie Doern de6919ecdd refactor: install external providers from module (#2637 ) # What does this PR do? Today, external providers are installed via the `external_providers_dir` in the config. This necessitates users to understand the `ProviderSpec` and set up their directories accordingly. This process splits up the config for the stack across multiple files, directories, and formats. Most (if not all) external providers today have a [get_provider_spec](`559cb18fbb/src/ramalama_stack/provider.py (L9)`) method that sits unused. Utilizing this method rather than the providers.d route allows for a much easier installation process for external providers and limits the amount of extra configuration a regular user has to do to get their stack off the ground. To accomplish this and wire it throughout the build process, Introduce the concept of a `module` for users to specify for an external provider upon build time. In order to facilitate this, align the build and run spec to use `Provider` class rather than the stringified provider_type that build currently uses. For example, say this is in your build config: ``` - provider_id: ramalama provider_type: remote::ramalama module: ramalama_stack ``` during build (in the various `build_...` scripts), additionally to installing any pip dependencies we will also install this module and use the `get_provider_spec` method to retrieve the ProviderSpec that is currently specified using `providers.d`. In production so far, providing instructions for installing external providers for users has been difficult: they need to install the module as a pre-req, create the providers.d directory, copy in the provider spec, and also copy in the necessary build/run yaml files. Accessing an external provider should be as easy as possible, and pointing to its installable module aligns more with the rest of our build and dependency management process. For now, `external_providers_dir` still exists as an alternate more declarative method of using external providers. ## Test Plan added an integration test installing an external provider from module and more unit test coverage for `get_provider_registry` ( the warning in yellow is expected, the module is installed inside of the build env, not where we are running the command) <img width="1119" height="400" alt="Screenshot 2025-07-24 at 11 30 48 AM" src="https://github.com/user-attachments/assets/1efbaf45-b9e8-451a-bd63-264ed664706d" /> <img width="1154" height="618" alt="Screenshot 2025-07-24 at 11 31 14 AM" src="https://github.com/user-attachments/assets/feb2b3ea-c5dd-418e-9662-9a3bd5dd6bdc" /> --------- Signed-off-by: Charlie Doern <cdoern@redhat.com>		2025-07-25 15:41:26 +02:00
..
_static	chore: Making name optional in openai_create_vector_store (#2858 )	2025-07-22 13:31:31 -04:00
notebooks	feat: Add Nvidia e2e beginner notebook and tool calling notebook (#1964 )	2025-06-16 11:29:01 -04:00
openapi_generator	feat: Add webmethod for deleting openai responses (#2160 )	2025-06-30 11:28:02 +02:00
resources	Several documentation fixes and fix link to API reference	2025-02-04 14:00:43 -08:00
source	refactor: install external providers from module (#2637 )	2025-07-25 15:41:26 +02:00
zero_to_hero_guide	feat: consolidate most distros into "starter" (#2516 )	2025-07-04 15:58:03 +02:00
conftest.py	fix: sleep after notebook test	2025-03-23 14:03:35 -07:00
contbuild.sh	Fix broken links with docs	2024-11-22 20:42:17 -08:00
dog.jpg	Support for Llama3.2 models and Swift SDK (#98 )	2024-09-25 10:29:58 -07:00
getting_started.ipynb	docs: Add quick_start.ipynb notebook equivalent of index.md Quickstart guide (#2128 )	2025-07-03 13:55:43 +02:00
getting_started_llama4.ipynb	docs: update docs to use "starter" than "ollama" (#2629 )	2025-07-05 08:44:57 +05:30
getting_started_llama_api.ipynb	docs: Add quick_start.ipynb notebook equivalent of index.md Quickstart guide (#2128 )	2025-07-03 13:55:43 +02:00
license_header.txt	Initial commit	2024-07-23 08:32:33 -07:00
make.bat	feat(pre-commit): enhance pre-commit hooks with additional checks (#2014 )	2025-04-30 11:35:49 -07:00
Makefile	first version of readthedocs (#278 )	2024-10-22 10:15:58 +05:30
original_rfc.md	chore: remove "rfc" directory and move original rfc to "docs" (#2718 )	2025-07-10 14:06:10 -07:00
quick_start.ipynb	docs: update docs to use "starter" than "ollama" (#2629 )	2025-07-05 08:44:57 +05:30
readme.md	chore: use groups when running commands (#2298 )	2025-05-28 09:13:16 -07:00

readme.md

Llama Stack Documentation

Here's a collection of comprehensive guides, examples, and resources for building AI applications with Llama Stack. For the complete documentation, visit our ReadTheDocs page.

Render locally

From the llama-stack root directory, run the following command to render the docs locally:

uv run --group docs sphinx-autobuild docs/source docs/build/html --write-all

You can open up the docs in your browser at http://localhost:8000

Content

Try out Llama Stack's capabilities through our detailed Jupyter notebooks:

Building AI Applications Notebook - A comprehensive guide to building production-ready AI applications using Llama Stack
Benchmark Evaluations Notebook - Detailed performance evaluations and benchmarking results
Zero-to-Hero Guide - Step-by-step guide for getting started with Llama Stack