Commit 1f718eb
Enable bitsandbytes quantization on AMD GPUs that use warp size 32 (vllm-project#27307)
Signed-off-by: sstamenk <[email protected]>
Signed-off-by: Bhagyashri <[email protected]>1 parent 452fd2b commit 1f718eb
File tree
2 files changed
+10
-4
lines changed- tests/models/quantization
- vllm/platforms
2 files changed
+10
-4
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
17 | | - | |
18 | | - | |
19 | | - | |
20 | | - | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
21 | 24 | | |
22 | 25 | | |
23 | 26 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
185 | 185 | | |
186 | 186 | | |
187 | 187 | | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
188 | 191 | | |
189 | 192 | | |
190 | 193 | | |
| |||
0 commit comments