Skip to content

Conversation

@copybara-service
Copy link
Contributor

Use non-temporal stores where possible in weight packing kernels.

I've also added an assertion for the correct packed weights pointer alignment. This is not a new constraint since the replaced xnn_store_s32 and _mm(256|512)?_store_.* instructions already had the same alignment requirements.

I've also added an assertion for the correct packed weights pointer alignment. This is _not_ a new constraint since the replaced `xnn_store_s32` and `_mm(256|512)?_store_.*` instructions already had the same alignment requirements.

PiperOrigin-RevId: 805224577
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants