llama-stack-mirror/llama_stack
Charlie Doern 6c3a40e3d2 feat: add huggingface post_training impl
adds an inline HF SFTTrainer provider. Alongside touchtune -- this is a super popular option for running training jobs. The config allows a user to specify some key fields such as a model, chat_template, device, etc

the provider comes with one recipe `finetune_single_device` which works both with and without LoRA.

any model that is a valid HF identifier can be given and the model will be pulled.

this has been tested so far with CPU and MPS device types, but should be compatible with CUDA out of the box

The provider processes the given dataset into the proper format, established the various steps per epoch, steps per save, steps per eval, sets a sane SFTConfig, and runs n_epochs of training

if checkpoint_dir is none, no model is saved. If there is a checkpoint dir, a model is saved every `save_steps` and at the end of training.

Signed-off-by: Charlie Doern <cdoern@redhat.com>
2025-05-16 16:37:30 -04:00
..
apis chore: more API validators (#2165) 2025-05-15 11:22:51 -07:00
cli feat: refactor external providers dir (#2049) 2025-05-15 20:17:03 +02:00
distribution feat: allow the interface on which the server will listen to be configured (#2015) 2025-05-16 12:59:31 -07:00
models fix: llama4 tool use prompt fix (#2103) 2025-05-06 22:18:31 -07:00
providers feat: add huggingface post_training impl 2025-05-16 16:37:30 -04:00
strong_typing chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
templates feat: allow the interface on which the server will listen to be configured (#2015) 2025-05-16 12:59:31 -07:00
ui feat: Adding dark mode, cleaning the UI a small bit, adding a link to the API documentation, and linting the code. (#2182) 2025-05-16 10:48:26 -07:00
__init__.py export LibraryClient 2024-12-13 12:08:00 -08:00
env.py refactor(test): move tools, evals, datasetio, scoring and post training tests (#1401) 2025-03-04 14:53:47 -08:00
log.py chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00
schema_utils.py chore: enable pyupgrade fixes (#1806) 2025-05-01 14:23:50 -07:00