This commit is contained in:
Nehanth Narendrula 2025-07-24 20:55:49 -07:00 committed by GitHub
commit afe58dd244
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
7 changed files with 930 additions and 207 deletions

View file

@ -24,6 +24,9 @@ HuggingFace-based post-training provider for fine-tuning models using the Huggin
| `weight_decay` | `<class 'float'>` | No | 0.01 | |
| `dataloader_num_workers` | `<class 'int'>` | No | 4 | |
| `dataloader_pin_memory` | `<class 'bool'>` | No | True | |
| `dpo_beta` | `<class 'float'>` | No | 0.1 | |
| `use_reference_model` | `<class 'bool'>` | No | True | |
| `dpo_loss_type` | `Literal['sigmoid', 'hinge', 'ipo', 'kto_pair'` | No | sigmoid | |
## Sample Configuration