mirror of
https://github.com/meta-llama/llama-stack.git
synced 2025-06-29 03:14:19 +00:00
1.7 KiB
1.7 KiB
Starting a Llama Stack
:maxdepth: 3
:hidden:
importing_as_library
self_hosted_distro/index
remote_hosted_distro/index
building_distro
ondevice_distro/index
You can start a Llama Stack server using "distributions" (see Concepts) in one of the following ways:
- Docker: we provide a number of pre-built Docker containers allowing you to get started instantly. If you are focused on application development, we recommend this option. You can also build your own custom Docker container.
- Conda: the
llama
CLI provides a simple set of commands to build, configure and run a Llama Stack server containing the exact combination of providers you wish. We have provided various templates to make getting started easier.
Which distribution to choose depends on the hardware you have for running LLM inference.
-
Do you have access to a machine with powerful GPUs? If so, we suggest:
-
Are you running on a "regular" desktop machine? If so, we suggest:
-
Do you have an API key for a remote inference provider like Fireworks, Together, etc.? If so, we suggest:
-
Do you want to run Llama Stack inference on your iOS / Android device If so, we suggest:
- iOS
- Android (coming soon)
You can also build your own custom distribution.