mirror of https://github.com/meta-llama/llama-stack.git synced 2025-10-04 04:04:14 +00:00

Ashwin Bharambe 7519b73fcc

feat(distro): fork off a starter-gpu distribution (#3240 )

The starter distribution added post-training which added torch
dependencies which pulls in all the nvidia CUDA libraries. This made our
starter container very big. We have worked hard to keep the starter
container small so it serves its purpose as a starter. This PR tries to
get it back to its size by forking off duplicate "-gpu" providers for
post-training. These forked providers are then used for a new
`starter-gpu` distribution which can pull in all dependencies.

2025-08-22 15:47:15 -07:00

456 B

Raw Blame History

inline::torchtune-gpu

Description

TorchTune-based post-training provider for fine-tuning and optimizing models using Meta's TorchTune framework.

Configuration

Field	Type	Required	Default	Description
`torch_seed`	`int \| None`	No
`checkpoint_format`	`Literal['meta', 'huggingface'`	No	meta

Sample Configuration

checkpoint_format: meta

456 B Raw Blame History

inline::torchtune-gpu

Description

Configuration

Sample Configuration

456 B

Raw Blame History