Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

docs: add ZeRO paper (1910.02054) to paper_index
#4551 opened Nov 20, 2025 by JenWei0312 Loading…
3 of 5 tasks
Fix bug with GRPO trainer when tokenizer outputs token_type_ids
#4549 opened Nov 20, 2025 by d0rbu Loading…
1 of 5 tasks
Add PSPO trust region method as alternative to clipping in GRPOTrainer
#4548 opened Nov 19, 2025 by MCDwyer Loading…
2 of 5 tasks
fix: add vllm_group_port
#4545 opened Nov 19, 2025 by pointerhacker Loading…
3 of 5 tasks
Add GRPO Wordle OpenEnv Colab
#4542 opened Nov 18, 2025 by sergiopaniego Loading…
5 tasks
Add target_parameters to LoraConfig
#4536 opened Nov 18, 2025 by jonnyli1125 Loading…
5 tasks
Add compute_metrics parameter for GRPOTrainer
#4534 opened Nov 17, 2025 by colinzhaoxp Loading…
[GRPO] Sequence-level TIS & MIS
#4530 opened Nov 16, 2025 by LeonEricsson Loading…
5 tasks
Add Qwen3VLGRPOTrainer for Qwen3-VL GRPO training
#4529 opened Nov 16, 2025 by NDNM1408 Loading…
Make skip_special_tokens configurable
#4521 opened Nov 13, 2025 by taha-yassine Loading…
3 of 5 tasks
fix tokenize bug for ppo_tldr example
#4520 opened Nov 13, 2025 by kaixuanliu Loading…
[GRPO] switch grpo liger loss to triton version
#4519 opened Nov 13, 2025 by kashif Draft
5 tasks
adding [SimPER](https://arxiv.org/abs/2502.00883)
#4486 opened Nov 6, 2025 by leeparkuky Loading…
2 of 5 tasks
Add attention_mask to signature_columns
#4459 opened Nov 5, 2025 by shubhamjain0594 Loading…
5 tasks
added 10 papers (+trainer cross-links) for #4407
#4441 opened Nov 3, 2025 by SSusantAchary Loading…
4 tasks done
ProTip! Filter pull requests by the default branch with base:main.