generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add
config_init_kwargs
option in GRPOConfig
#4069
opened Sep 12, 2025 by
hokuyama0106
Loading…
2 of 5 tasks
🗑️ Remove deprecated
AlignPropTrainer
, DDPOTrainer
and IterativeSFTTrainer
#4068
opened Sep 12, 2025 by
qgallouedec
Loading…
5 tasks
🧹 Remove
max_batch_tokens
, num_blocks
and block_size
from generation kwargs
#4065
opened Sep 11, 2025 by
qgallouedec
Loading…
feat: Add NPU and XPU support for activation offloading
#4056
opened Sep 10, 2025 by
zilongzheng
Loading…
2 of 5 tasks
✨ Add logging for training completion and model saving in training scripts
#4048
opened Sep 9, 2025 by
qgallouedec
Loading…
[Draft] Add configurable dataset column logging to GRPOTrainer W&B tables
#4045
opened Sep 9, 2025 by
davanstrien
•
Draft
Fix #3982: Fix DPO Trainer support for Gemma 3 vision models
#4022
opened Sep 6, 2025 by
akshay-babbar
Loading…
Fix: undefined
current_gradient_accumulation_steps
#4014
opened Sep 5, 2025 by
ysjprojects
Loading…
2 of 5 tasks
Fix: ignore precompute_ref_log_probs when use_liger_loss=True
#4008
opened Sep 4, 2025 by
ginkyenglee
Loading…
5 tasks
⚖️ Align SFT and DPO for model creation and deprecate
DPOConfig.padding_value
in favour or pad_token_id
#4006
opened Sep 4, 2025 by
qgallouedec
Loading…
5 tasks
Enable saving and loading precomputed reference log probabilities in …
#3986
opened Sep 1, 2025 by
ginkyenglee
Loading…
3 tasks
[SFTTrainer]: Check for assistant mask up to max_length
#3930
opened Aug 20, 2025 by
pramodith
Loading…
3 of 5 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.