Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Context Parallelism benchmark guide
#4075 opened Sep 12, 2025 by sergiopaniego Loading…
5 tasks
Add config_init_kwargs option in GRPOConfig
#4069 opened Sep 12, 2025 by hokuyama0106 Loading…
2 of 5 tasks
Add VLM support to RLOO trainer
#4067 opened Sep 11, 2025 by behroozazarkhalili Loading…
feat: Add NPU and XPU support for activation offloading
#4056 opened Sep 10, 2025 by zilongzheng Loading…
2 of 5 tasks
Enable XPU for vllm client
#4031 opened Sep 8, 2025 by jiqing-feng Loading…
vllm sleep mode support
#4028 opened Sep 8, 2025 by ved1beta Loading…
2 of 5 tasks
Fix: undefined current_gradient_accumulation_steps
#4014 opened Sep 5, 2025 by ysjprojects Loading…
2 of 5 tasks
Improve typing of SFT trainer
#4007 opened Sep 4, 2025 by cyyever Loading…
[GFPO]: implement GFPO in GRPOTrainer
#3989 opened Sep 1, 2025 by Peter-Chou Loading…
3 of 5 tasks
fix bug when using dataset streaming by accelerate
#3950 opened Aug 25, 2025 by kaixuanliu Loading…
🐳 Docker update
#3931 opened Aug 20, 2025 by qgallouedec Loading…
[SFTTrainer]: Check for assistant mask up to max_length
#3930 opened Aug 20, 2025 by pramodith Loading…
3 of 5 tasks
[DRAFT] Refactor DPO
#3906 opened Aug 15, 2025 by qgallouedec Draft
5 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.