Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix SFTTrainer support for single-image data
#5132 opened Feb 19, 2026 by qgallouedec Loading…
Fix SFTTrainer crash when train_dataset=None
#5131 opened Feb 19, 2026 by albertvillanova Loading…
MGPO feature addition
#5126 opened Feb 19, 2026 by damoonsh Loading…
2 of 5 tasks
feat(experimental): Divergence Proximal Policy Optimization
#5117 opened Feb 17, 2026 by LeonEricsson Loading…
5 tasks
feature: Configurable num logprobs in vLLM generation
#5107 opened Feb 16, 2026 by LeonEricsson Loading…
2 of 6 tasks
Add support for DGPO (ICLR 2026) to GRPO
#5102 opened Feb 15, 2026 by YanqiDai Loading…
5 tasks done
Add environment_factory to GRPOTrainer
#5093 opened Feb 13, 2026 by qgallouedec Loading…
Add support for DPPO [WIP]
#5065 opened Feb 10, 2026 by catherinelee274 Draft
5 tasks
Fix GRPO VLM prompt handling for string prompts
#5064 opened Feb 10, 2026 by akshan-main Loading…
5 tasks done
3
5
Add CFPO objective to GRPO trainer
#5027 opened Feb 9, 2026 by asparius Loading…
Add support for MaxRL
#5026 opened Feb 9, 2026 by catherinelee274 Loading…
4 of 5 tasks
Feature/ HICRA implementation
#4997 opened Feb 6, 2026 by w601sxs Loading…
2 of 5 tasks
Add OpenEnv's Rubrics support
#4994 opened Feb 6, 2026 by sergiopaniego Draft
5 tasks
fix: add gradient checkpointing to PolicyAndValueWrapper
#4955 opened Feb 3, 2026 by lvhungdev Loading…
3 of 5 tasks
OpenEnv clients async support update
#4949 opened Feb 2, 2026 by sergiopaniego Loading…
5 tasks
[Experimental] Add SDFT trainer, config, docs, and tests
#4941 opened Jan 31, 2026 by Shekswess Loading…
4 of 5 tasks
Update RewardFunc type to use RewardCallable protocol
#4938 opened Jan 31, 2026 by amit9oct Loading…
2 of 5 tasks
ProTip! Exclude everything labeled bug with -label:bug.