phoenix-oss/llama-stack-mirror: Composable building blocks to build Llama Apps

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-03 09:53:45 +00:00

Composable building blocks to build Llama Apps https://llama-stack.readthedocs.io

Find a file

Hardik Shah b0f3406a08 deleting bash script as this is not done via cli		2024-07-21 12:55:49 -07:00
toolchain	enable import of subcommands from llama-agentic-system	2024-07-21 12:54:48 -07:00
.gitignore	more work on agent definitions	2024-07-09 13:53:09 -07:00
fp8_requirements.txt	cleanup for fp8 and requirements etc	2024-07-20 23:21:55 -07:00
README.md	update README a bit	2024-07-20 23:26:50 -07:00
requirements.txt	update requirements for running standalone	2024-07-19 18:11:25 -07:00

This repo contains the API specifications for various parts of the Llama Stack. The Stack consists of toolchain-apis and agentic-apis.

The tool chain apis that are covered --

Running FP8

You need fbgemm-gpu package which requires torch >= 2.4.0 (currently only in nightly, but releasing shortly...).

ENV=fp8_env
conda create -n $ENV python=3.10
conda activate $ENV

pip3 install -r fp8_requirements.txt

Set up virtual environment

python3 -m venv ~/.venv/toolchain/ 
source ~/.venv/toolchain/bin/activate

with-proxy pip3 install -r requirements.txt

Run the generate.sh script

cd source && sh generate.sh