Fix incorrect Tensor Size for NestedTensor QKV Transform #2450

yucai-intel · 2025-12-02T09:16:04Z

To solve #2182 : Q (Query) tensor output size from torch.transform_bias_rescale_qkv was mismatched against the expected reference size in test cases involving Nested Tensors where the sequence length (T) was not a multiple of 8 after implicit padding.

Resolution: The resolution involved introducing logic within the C++ function transform_bias_rescale_qkv_xpu specifically for the Nested Tensor case to explicitly use the calculated sequence length T to resize the output q, k, and v tensors, thereby ensuring their final size matches the shape derived by the Python reference implementation.

Copilot

Pull request overview

This PR fixes a tensor size mismatch issue in the transform_bias_rescale_qkv_xpu function when processing NestedTensors. The problem occurred when the sequence length (T) wasn't a multiple of 8 after implicit padding, causing the Q tensor output size to differ from the expected reference size.

Key Changes:

Added explicit padding logic to round up the sequence length T to the next multiple of 8 for NestedTensor cases
This ensures output tensor dimensions align with Tensor core requirements and match the Python reference implementation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/ATen/native/transformers/Attention.cpp

CuiYifeng

LGTM.

liangan1 · 2025-12-09T05:29:55Z

src/ATen/native/transformers/Attention.cpp

+    // cores. Otherwise, sometimes with padding, *no* row will have the maximum
+    // sequence length and so we'll have a non-divisible-by-8 dimension even if
+    // the model author chose a multiple of 8.
+    T = T + (8 - (T % 8)) % 8;


The dpas should not have limitation, the m can be 1~8, why we need to change it here?

CuiYifeng requested a review from Copilot December 9, 2025 01:16

Copilot AI reviewed Dec 9, 2025

View reviewed changes

src/ATen/native/transformers/Attention.cpp Show resolved Hide resolved

CuiYifeng approved these changes Dec 9, 2025

View reviewed changes

Update Attention.cpp

980872b

CuiYifeng force-pushed the yucai/mha/nested/fix branch from 590ebe6 to 980872b Compare December 9, 2025 02:59

CuiYifeng changed the title ~~Fix: Incorrect Tensor Size for NestedTensor QKV Transform~~ Fix incorrect Tensor Size for NestedTensor QKV Transform Dec 9, 2025

CuiYifeng requested a review from liangan1 December 9, 2025 03:01

liangan1 reviewed Dec 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix incorrect Tensor Size for NestedTensor QKV Transform #2450

Fix incorrect Tensor Size for NestedTensor QKV Transform #2450

Uh oh!

yucai-intel commented Dec 2, 2025 •

edited by CuiYifeng

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

CuiYifeng left a comment

Uh oh!

liangan1 Dec 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix incorrect Tensor Size for NestedTensor QKV Transform #2450

Are you sure you want to change the base?

Fix incorrect Tensor Size for NestedTensor QKV Transform #2450

Uh oh!

Conversation

yucai-intel commented Dec 2, 2025 • edited by CuiYifeng Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

CuiYifeng left a comment

Choose a reason for hiding this comment

Uh oh!

liangan1 Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

yucai-intel commented Dec 2, 2025 •

edited by CuiYifeng

Loading