Commit 59a4988
committed
Fix incorrect reduction for small kernels
Ensures num_items is recalculated after updating group_width when num_items is less than max_sg_sz, preventing incorrect parallelism configuration.1 parent b44682c commit 59a4988
1 file changed
+3
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
259 | 259 | | |
260 | 260 | | |
261 | 261 | | |
| 262 | + | |
262 | 263 | | |
| 264 | + | |
| 265 | + | |
263 | 266 | | |
264 | 267 | | |
265 | 268 | | |
| |||
0 commit comments