generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
args.gradient_checkpointing = False
instead of args = dataclasses.replace(args, gradient_checkpointing=False)
#3981
opened Aug 30, 2025 by
qgallouedec
Loading…
Fix GRPO CLI by setting parameters for
get_soft_overlong_punishment
#3972
opened Aug 29, 2025 by
qgallouedec
Loading…
🎯 Add Trackio integration documentation and update TOC
#3971
opened Aug 28, 2025 by
qgallouedec
Loading…
5 tasks
👷 Added Kernels on the Hub x TRL guide
#3969
opened Aug 28, 2025 by
sergiopaniego
Loading…
2 of 8 tasks
[GRPO] Adds an option to sleep vllm when running in colocated mode
#3968
opened Aug 28, 2025 by
edbeeching
Loading…
[GRPO]: Fix Multi-GPU training for Entropy based masking of tokens.
#3964
opened Aug 27, 2025 by
pramodith
Loading…
2 of 5 tasks
[SFTTrainer]: Check for assistant mask up to max_length
#3930
opened Aug 20, 2025 by
pramodith
Loading…
3 of 5 tasks
GRPOTrainer : fix prompt truncation for multimodal inputs with multiple image tokens
#3879
opened Aug 11, 2025 by
artem-spector
Loading…
4 tasks
[GRPO] Truncated Importance Sampling to address rollout-training mismatch
#3867
opened Aug 7, 2025 by
LeonEricsson
•
Draft
5 tasks
[#3647] Fix: Assign default values in the GKDTrainer's constructor only when …
#3851
opened Aug 5, 2025 by
seungduk-yanolja
Loading…
2 of 5 tasks
Update profiling.py: fix scoping problems for wandb and mlflow
#3845
opened Aug 4, 2025 by
markshinyounglee
Loading…
5 tasks done
Add vLLM server mode and VLM support to OnlineDPOTrainer
#3783
opened Jul 27, 2025 by
vaelev
Loading…
6 tasks done
change doc for
num_iterations
and steps_per_generation
to hopefully make them more clear and differentiate between them more clearly
#3761
opened Jul 23, 2025 by
avishaiElmakies
Loading…
2 of 5 tasks
Dynamic sampling option in GRPO trainer based on DAPO paper
#3758
opened Jul 23, 2025 by
almeidava93
Loading…
2 of 5 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.