Fix bug with GRPO trainer when tokenizer outputs token_type_ids #4549

d0rbu · 2025-11-20T00:48:47Z

When training with a tokenizer that has return_token_type_ids=True as default, we get an error from _validate_model_kwargs in the transformers library:

The following `model_kwargs` are not used by the model: ['token_type_ids'] (note: typos in the generate arguments will also show up in this list)

This sets return_token_type_ids to False to prevent this from happening in the GRPO trainer. PleIAs/Baguettotron and PleIAs/Monad are examples of models that I ran into this issue with.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

qgallouedec · 2025-11-21T19:43:35Z

I'm not sure about this one. The issue probably comes from the fact that that the tokenizer configuration is wrong. A patch for this could be

trainer = GRPOTrainer(
    model="PleIAs/Monad",
    ...
)
trainer.processing_class.model_input_names.pop("token_type_ids")
trainer.train()

d0rbu · 2025-11-21T19:57:07Z

Got it, thanks for the feedback!

Prevent tokenizer from returning token_type_ids in GRPO trainer

e5cbfe8

qgallouedec closed this Nov 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix bug with GRPO trainer when tokenizer outputs token_type_ids #4549

Fix bug with GRPO trainer when tokenizer outputs token_type_ids #4549

d0rbu commented Nov 20, 2025 •

edited

Loading

Uh oh!

qgallouedec commented Nov 21, 2025

Uh oh!

d0rbu commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix bug with GRPO trainer when tokenizer outputs token_type_ids #4549

Fix bug with GRPO trainer when tokenizer outputs token_type_ids #4549

Conversation

d0rbu commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Before submitting

Who can review?

Uh oh!

qgallouedec commented Nov 21, 2025

Uh oh!

d0rbu commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

d0rbu commented Nov 20, 2025 •

edited

Loading