Skip to content

Conversation

@genquan9
Copy link
Contributor

@genquan9 genquan9 commented Nov 7, 2025

Important

The Update branch button must only be pressed in very rare occassions.
An outdated branch is never blocking the merge of a PR.
Please reach out to the automation team before pressing that button.

What does this PR do ?

This PR is to fix Gemma3VL model training issues.

Collection: [Note which collection this PR will affect]

Changelog

  • updated existing gemma3vl finetune script to make sure all the parameters are correct and also support energon data format
  • added import/export scripts which are specifically for gemma3vl models
  • fixed issues in gemma3vl dataloader to make sure the real data can be loaded properly

Usage

  • You can potentially add a usage example below
# torchrun --nproc_per_node=1 ./scripts/vlm/gemma3vl_finetune.py --data_type=mock

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

  • [Y] Make sure you read and followed Contributor guidelines
  • [N] Did you write any new necessary tests?
  • [Y] Did you add or update any necessary documentation?
  • [N] Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • [Y] Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • [Y] Bugfix

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

hemildesai
hemildesai previously approved these changes Nov 14, 2025
Copy link
Collaborator

@hemildesai hemildesai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM and the changes are all scoped to Gemma so shouldn't affect anything else.

meatybobby and others added 10 commits November 20, 2025 21:27
* Update MagpieTTS

Signed-off-by: Jason <[email protected]>

* allow None in dataset path

Signed-off-by: Jason <[email protected]>

* try to fix test by removing lhotse; fix yamls in fast dev run tests

Signed-off-by: Jason <[email protected]>

* increase zeroshot cer value; attempt to fix PO test; add back lhotse in parakeet inference to test segmentation fault

Signed-off-by: Jason <[email protected]>

* remove branch from test

Signed-off-by: Jason <[email protected]>

* use batch_size 1

Signed-off-by: Jason <[email protected]>

* update GRPO test script

Signed-off-by: Jason <[email protected]>

* add use_lhotse as a param to transcribe; attempt to fix PO test again; attempt to catch error

Signed-off-by: Jason <[email protected]>

* fix tests

Signed-off-by: Jason <[email protected]>

* update rnnt transcribe; fix po test again

Signed-off-by: Jason <[email protected]>

* Apply suggestion from @XuesongYang

Signed-off-by: Xuesong Yang <[email protected]>

Signed-off-by: Xuesong Yang <[email protected]>

* Move FCD copyright text from TorchEval to top of file

Signed-off-by: Fejgin, Roy <[email protected]>

* Remove duplicate copyright text

It is now at the top of the file.

Signed-off-by: Fejgin, Roy <[email protected]>

* Apply suggestion from @XuesongYang

Signed-off-by: Xuesong Yang <[email protected]>

Signed-off-by: Xuesong Yang <[email protected]>

* Apply suggestion from @XuesongYang

Signed-off-by: Xuesong Yang <[email protected]>

Signed-off-by: Xuesong Yang <[email protected]>

* Apply suggestion from @XuesongYang

Signed-off-by: Xuesong Yang <[email protected]>

Signed-off-by: Xuesong Yang <[email protected]>

* Apply suggestion from @XuesongYang

Signed-off-by: Xuesong Yang <[email protected]>

Signed-off-by: Xuesong Yang <[email protected]>

* Fix OnlinePO test: escape a special character in command line

Signed-off-by: Fejgin, Roy <[email protected]>

* Easier-to-read way to quote a special character in OnlinePO test

Signed-off-by: Fejgin, Roy <[email protected]>

* Work around ASR Lhotse issue

... and remove some debug code.

Signed-off-by: Fejgin, Roy <[email protected]>

* Remove FCD metric for now

Signed-off-by: Fejgin, Roy <[email protected]>

* Remove unused import

Signed-off-by: Fejgin, Roy <[email protected]>

* Update examples/tts/conf/magpietts/magpietts_lhotse.yaml

Signed-off-by: Fejgin, Roy <[email protected]>

Signed-off-by: Roy Fejgin <[email protected]>

---------

Signed-off-by: Jason <[email protected]>
Signed-off-by: Xuesong Yang <[email protected]>
Signed-off-by: Fejgin, Roy <[email protected]>
Signed-off-by: Roy Fejgin <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>
Co-authored-by: Fejgin, Roy <[email protected]>
Signed-off-by: genquan9 <[email protected]>
…IA-NeMo#15091)

* ASR Inference: load decoding params from config for RNN-T

Signed-off-by: Vladimir Bataev <[email protected]>
Signed-off-by: genquan9 <[email protected]>
Signed-off-by: nithinraok <[email protected]>
Signed-off-by: genquan9 <[email protected]>
@genquan9
Copy link
Contributor Author

genquan9 commented Nov 20, 2025

i added missing headers for the new added files: 104d821

@chtruong814 chtruong814 merged commit ac2795e into NVIDIA-NeMo:main Nov 21, 2025
486 of 507 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.