llama-stack-mirror/docs/source/concepts/distributions.md at 66f4af7fecf16eafe144ae0a0265bf833266589b

phoenix-oss/llama-stack-mirror

Fork 1

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-03 19:57:35 +00:00

raghotham d73955a41e

Integration Auth Tests / test-matrix (oauth2_token) (push) Failing after 1s

Details

Vector IO Integration Tests / test-matrix (push) Failing after 2s

Details

Test Llama Stack Build / build-ubi9-container-distribution (push) Failing after 1s

Details

Test External Providers Installed via Module / test-external-providers-from-module (venv) (push) Has been skipped

Details

Pre-commit / pre-commit (push) Failing after 3s

Details

Test Llama Stack Build / generate-matrix (push) Failing after 3s

Details

Integration Tests (Replay) / Integration Tests (, , , client=, vision=) (push) Failing after 5s

Details

Test Llama Stack Build / build-custom-container-distribution (push) Failing after 3s

Details

Test Llama Stack Build / build (push) Has been skipped

Details

Unit Tests / unit-tests (3.12) (push) Failing after 1s

Details

Python Package Build Test / build (3.13) (push) Failing after 2s

Details

Test Llama Stack Build / build-single-provider (push) Failing after 5s

Details

Python Package Build Test / build (3.12) (push) Failing after 4s

Details

SqlStore Integration Tests / test-postgres (3.13) (push) Failing after 7s

Details

Unit Tests / unit-tests (3.13) (push) Failing after 2s

Details

UI Tests / ui-tests (22) (push) Failing after 4s

Details

Test External API and Providers / test-external (venv) (push) Failing after 4s

Details

Update ReadTheDocs / update-readthedocs (push) Failing after 3s

Details

SqlStore Integration Tests / test-postgres (3.12) (push) Failing after 12s

Details

chore: remove absolute paths (#3263 )

# What does this PR do?
Finding these issues while moving to github pages.


## Test Plan
uv run --group docs sphinx-autobuild docs/source docs/build/html
--write-all

2025-08-27 12:04:25 -07:00

1.6 KiB

Raw Blame History

Distributions

While there is a lot of flexibility to mix-and-match providers, often users will work with a specific set of providers (hardware support, contractual obligations, etc.) We therefore need to provide a convenient shorthand for such collections. We call this shorthand a Llama Stack Distribution or a Distro. One can think of it as specific pre-packaged versions of the Llama Stack. Here are some examples:

Remotely Hosted Distro: These are the simplest to consume from a user perspective. You can simply obtain the API key for these providers, point to a URL and have all Llama Stack APIs working out of the box. Currently, Fireworks and Together provide such easy-to-consume Llama Stack distributions.

Locally Hosted Distro: You may want to run Llama Stack on your own hardware. Typically though, you still need to use Inference via an external service. You can use providers like HuggingFace TGI, Fireworks, Together, etc. for this purpose. Or you may have access to GPUs and can run a vLLM or NVIDIA NIM instance. If you "just" have a regular desktop machine, you can use Ollama for inference. To provide convenient quick access to these options, we provide a number of such pre-configured locally-hosted Distros.

On-device Distro: To run Llama Stack directly on an edge device (mobile phone or a tablet), we provide Distros for iOS and Android

1.6 KiB Raw Blame History

Distributions

1.6 KiB

Raw Blame History