mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 09:53:45 +00:00

History

Sébastien Han 7ee0ee7843 chore!: remove model mgmt from CLI for Hugging Face CLI (#3700 ) This change removes the `llama model` and `llama download` subcommands from the CLI, replacing them with recommendations to use the Hugging Face CLI instead. Rationale for this change: - The model management functionality was largely duplicating what Hugging Face CLI already provides, leading to unnecessary maintenance overhead (except the download source from Meta?) - Maintaining our own implementation required fixing bugs and keeping up with changes in model repositories and download mechanisms - The Hugging Face CLI is more mature, widely adopted, and better maintained - This allows us to focus on the core Llama Stack functionality rather than reimplementing model management tools Changes made: - Removed all model-related CLI commands and their implementations - Updated documentation to recommend using `huggingface-cli` for model downloads - Removed Meta-specific download logic and statements - Simplified the CLI to focus solely on stack management operations Users should now use: - `huggingface-cli download` for downloading models - `huggingface-cli scan-cache` for listing downloaded models This is a breaking change as it removes previously available CLI commands. Signed-off-by: Sébastien Han <seb@redhat.com>		2025-10-09 16:50:33 -07:00
..
docs	chore!: remove model mgmt from CLI for Hugging Face CLI (#3700 )	2025-10-09 16:50:33 -07:00
notebooks	chore: unpublish /inference/chat-completion (#3609 )	2025-09-30 11:00:42 -07:00
openapi_generator	feat(api): add extra_body parameter support with shields example (#3670 )	2025-10-03 13:25:09 -07:00
src	docs: api separation (#3630 )	2025-10-01 10:13:31 -07:00
static	docs: API docstrings cleanup for better documentation rendering (#3661 )	2025-10-06 10:46:33 -07:00
supplementary	docs: adding supplementary markdown content to API specs (#3632 )	2025-10-01 10:15:30 -07:00
zero_to_hero_guide	chore!: remove --env from `llama stack run` (#3711 )	2025-10-07 20:58:15 -07:00
docusaurus.config.ts	docs: add favicon and mobile styling (#3650 )	2025-10-02 10:42:54 +02:00
dog.jpg	Support for Llama3.2 models and Swift SDK (#98 )	2024-09-25 10:29:58 -07:00
getting_started.ipynb	chore!: remove --env from `llama stack run` (#3711 )	2025-10-07 20:58:15 -07:00
getting_started_llama4.ipynb	chore!: remove model mgmt from CLI for Hugging Face CLI (#3700 )	2025-10-09 16:50:33 -07:00
getting_started_llama_api.ipynb	chore!: remove --env from `llama stack run` (#3711 )	2025-10-07 20:58:15 -07:00
license_header.txt	Initial commit	2024-07-23 08:32:33 -07:00
original_rfc.md	chore(rename): move llama_stack.distribution to llama_stack.core (#2975 )	2025-07-30 23:30:53 -07:00
package-lock.json	docs: docusaurus setup (#3541 )	2025-09-24 14:11:30 -07:00
package.json	docs: docusaurus setup (#3541 )	2025-09-24 14:11:30 -07:00
quick_start.ipynb	chore!: remove --env from `llama stack run` (#3711 )	2025-10-07 20:58:15 -07:00
README.md	docs: docusaurus setup (#3541 )	2025-09-24 14:11:30 -07:00
sidebars.ts	docs: Update docs navbar config (#3653 )	2025-10-02 16:48:38 +02:00
tsconfig.json	docs: docusaurus setup (#3541 )	2025-09-24 14:11:30 -07:00

README.md

Llama Stack Documentation

Here's a collection of comprehensive guides, examples, and resources for building AI applications with Llama Stack. For the complete documentation, visit our Github page.

Render locally

From the llama-stack docs/ directory, run the following commands to render the docs locally:

npm install
npm run gen-api-docs all
npm run build
npm run serve

You can open up the docs in your browser at http://localhost:3000

Content

Try out Llama Stack's capabilities through our detailed Jupyter notebooks:

Building AI Applications Notebook - A comprehensive guide to building production-ready AI applications using Llama Stack
Benchmark Evaluations Notebook - Detailed performance evaluations and benchmarking results
Zero-to-Hero Guide - Step-by-step guide for getting started with Llama Stack