Add limited RTX PRO 6000 coverage. #6841

alliepiper · 2025-12-02T17:03:56Z

The cuda-python team offered to let us use their hw while we wait for our much larger order to arrive. Adding minimal CUB coverage to ensure that our blackwell implementations don't regress.

copy-pr-bot · 2025-12-02T17:03:59Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

alliepiper · 2025-12-02T17:04:11Z

/ok to test

bernhardmgruber · 2025-12-02T18:25:08Z

I see a CI test failure in:

91/123 Test #336: cub.test.device.segmented_scan_api.lid_0 ......................***Failed    0.41 sec
  -- >> Running:
  	/home/coder/cccl/build/cuda13.0-gcc14/cub/bin/cub.test.device.segmented_scan_api.lid_0
  /home/coder/cccl/lib/cmake/libcudacxx/../../../libcudacxx/include/cuda/__cmath/fast_modulo_division.h:112: operator/: block: [0,0,0], thread: [0,0,0] Assertion `dividend must be non-negative` failed.

Funny, it's an API test. The test was added recently in #6022.

Also, the test runtime here seems a bit excessive. Is this normal?

          Start 455: cub.test.device.radix_sort_decomposer_fail.lid_0
  119/123 Test #455: cub.test.device.radix_sort_decomposer_fail.lid_0 ..............   Passed  1258.18 sec

alliepiper · 2025-12-02T18:37:39Z

@oleksandr-pavlyk can you take a look at the segmented scan issue? This is failing on RTX PRO 6000 (sm120).

Also, the test runtime here seems a bit excessive. Is this normal? cub.test.device.radix_sort_decomposer_fail.lid_0 1258.18 sec

Those _fail tests invoke compilers internally and check that compilation fails with a specific error. That does seem excessive, even for the less-powerful CPUs on the GPU runners. Looking at results on other runners from the nightlies, it takes about half as long on RTXA6000 + H100

H100: https://github.com/NVIDIA/cccl/actions/runs/19846009437/job/56871704747
A6000: https://github.com/NVIDIA/cccl/actions/runs/19846009437/job/56871704740

It's a very simple TU, might be worth seeing if we can trigger the failure sooner.

bernhardmgruber · 2025-12-02T18:52:36Z

I opened #6845 for the long test time.

oleksandr-pavlyk · 2025-12-03T19:02:21Z

I opened #6868 to fix assertions failing the segmented scan API test.

alliepiper · 2025-12-04T15:37:04Z

/ok to test

The cuda-python team offered to let us use their hw while we wait for our much larger order to arrive. Adding minimal CUB coverage to ensure that our blackwell implementations don't regress.

github-actions · 2025-12-05T07:49:08Z

🥳 CI Workflow Results

🟩 Finished in 12h 45m: Pass: 100%/270 | Total: 3d 18h | Max: 4h 15m | Hits: 97%/374695

See results here.

bernhardmgruber · 2025-12-05T07:58:10Z

Thank you @alliepiper and @kkraus14 for making that happen!

github-project-automation bot added this to CCCL Dec 2, 2025

github-project-automation bot moved this to Todo in CCCL Dec 2, 2025

cccl-authenticator-app bot moved this from Todo to In Progress in CCCL Dec 2, 2025

bernhardmgruber approved these changes Dec 2, 2025

View reviewed changes

github-project-automation bot moved this from In Progress to In Review in CCCL Dec 2, 2025

davebayer approved these changes Dec 2, 2025

View reviewed changes

This comment has been minimized.

Sign in to view

Add limited RTX PRO 6000 coverage.

050b7f9

The cuda-python team offered to let us use their hw while we wait for our much larger order to arrive. Adding minimal CUB coverage to ensure that our blackwell implementations don't regress.

alliepiper force-pushed the rtxpro6000 branch from 50dbc90 to 050b7f9 Compare December 4, 2025 19:00

alliepiper marked this pull request as ready for review December 4, 2025 19:01

alliepiper requested a review from a team as a code owner December 4, 2025 19:01

alliepiper requested a review from jrhemstad December 4, 2025 19:01

alliepiper enabled auto-merge (squash) December 4, 2025 19:01

This comment has been minimized.

Sign in to view

alliepiper merged commit afbd94d into NVIDIA:main Dec 5, 2025
637 of 642 checks passed

github-project-automation bot moved this from In Review to Done in CCCL Dec 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add limited RTX PRO 6000 coverage. #6841

Add limited RTX PRO 6000 coverage. #6841

alliepiper commented Dec 2, 2025

Uh oh!

copy-pr-bot bot commented Dec 2, 2025

Uh oh!

alliepiper commented Dec 2, 2025

Uh oh!

This comment has been minimized.

bernhardmgruber commented Dec 2, 2025 •

edited

Loading

Uh oh!

alliepiper commented Dec 2, 2025 •

edited

Loading

Uh oh!

bernhardmgruber commented Dec 2, 2025

Uh oh!

oleksandr-pavlyk commented Dec 3, 2025

Uh oh!

alliepiper commented Dec 4, 2025

Uh oh!

This comment has been minimized.

This comment has been minimized.

github-actions bot commented Dec 5, 2025

Uh oh!

Uh oh!

bernhardmgruber commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add limited RTX PRO 6000 coverage. #6841

Add limited RTX PRO 6000 coverage. #6841

Conversation

alliepiper commented Dec 2, 2025

Uh oh!

copy-pr-bot bot commented Dec 2, 2025

Uh oh!

alliepiper commented Dec 2, 2025

Uh oh!

This comment has been minimized.

bernhardmgruber commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alliepiper commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bernhardmgruber commented Dec 2, 2025

Uh oh!

oleksandr-pavlyk commented Dec 3, 2025

Uh oh!

alliepiper commented Dec 4, 2025

Uh oh!

This comment has been minimized.

This comment has been minimized.

github-actions bot commented Dec 5, 2025

🥳 CI Workflow Results

🟩 Finished in 12h 45m: Pass: 100%/270 | Total: 3d 18h | Max: 4h 15m | Hits: 97%/374695

Uh oh!

Uh oh!

bernhardmgruber commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bernhardmgruber commented Dec 2, 2025 •

edited

Loading

alliepiper commented Dec 2, 2025 •

edited

Loading