llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-28 06:41:59 +00:00

History

Charlie Doern d6228bb90e fix: proper checkpointing logic for HF trainer currently only the last saved model is reported as a checkpoint and associated with the job UUID. since the HF trainer handles checkpoint collection during training, we need to add all of the `checkpoint-*` folders as Checkpoint objects. Adjust the save strategy to be per-epoch to make this easier and to use less storage Signed-off-by: Charlie Doern <cdoern@redhat.com>		2025-06-25 20:01:36 -04:00
..
recipes	fix: proper checkpointing logic for HF trainer	2025-06-25 20:01:36 -04:00
__init__.py	feat: add huggingface post_training impl (#2132 )	2025-05-16 14:41:28 -07:00
config.py	feat: add huggingface post_training impl (#2132 )	2025-05-16 14:41:28 -07:00
post_training.py	feat: add huggingface post_training impl (#2132 )	2025-05-16 14:41:28 -07:00