llama-stack-mirror

mirror of https://github.com/meta-llama/llama-stack.git synced 2025-12-28 07:21:59 +00:00

History

Charlie Doern 6c3a40e3d2 feat: add huggingface post_training impl adds an inline HF SFTTrainer provider. Alongside touchtune -- this is a super popular option for running training jobs. The config allows a user to specify some key fields such as a model, chat_template, device, etc the provider comes with one recipe `finetune_single_device` which works both with and without LoRA. any model that is a valid HF identifier can be given and the model will be pulled. this has been tested so far with CPU and MPS device types, but should be compatible with CUDA out of the box The provider processes the given dataset into the proper format, established the various steps per epoch, steps per save, steps per eval, sets a sane SFTConfig, and runs n_epochs of training if checkpoint_dir is none, no model is saved. If there is a checkpoint dir, a model is saved every `save_steps` and at the end of training. Signed-off-by: Charlie Doern <cdoern@redhat.com>		2025-05-16 16:37:30 -04:00
..
agents	fix: Responses API: handle type=None in streaming tool calls (#2166 )	2025-05-14 14:16:33 -07:00
datasetio	chore(refact): move paginate_records fn outside of datasetio (#2137 )	2025-05-12 10:56:14 -07:00
eval	feat: implementation for agent/session list and describe (#1606 )	2025-05-07 14:49:23 +02:00
inference	chore: enable pyupgrade fixes (#1806 )	2025-05-01 14:23:50 -07:00
ios/inference	chore: removed executorch submodule (#1265 )	2025-02-25 21:57:21 -08:00
post_training	feat: add huggingface post_training impl	2025-05-16 16:37:30 -04:00
safety	chore: enable pyupgrade fixes (#1806 )	2025-05-01 14:23:50 -07:00
scoring	chore: enable pyupgrade fixes (#1806 )	2025-05-01 14:23:50 -07:00
telemetry	feat: add metrics query API (#1394 )	2025-05-07 10:11:26 -07:00
tool_runtime	feat: Adding support for customizing chunk context in RAG insertion and querying (#2134 )	2025-05-14 21:56:20 -04:00
vector_io	feat: implementation for agent/session list and describe (#1606 )	2025-05-07 14:49:23 +02:00
__init__.py	`impls` -> `inline`, `adapters` -> `remote` (#381 )	2024-11-06 14:54:05 -08:00