Commit 2f01628
vulkan: Handle updated FA dim2/3 definition (ggml-org#14518)
* vulkan: Handle updated FA dim2/3 definition
Pack mask boolean and n_head_log2 into a single dword to keep the push
constant block under the 128B limit.
* handle null mask for gqa
* allow gqa with dim3>11 parent 344c1ce commit 2f01628
File tree
5 files changed
+26
-24
lines changed- ggml/src/ggml-vulkan
- vulkan-shaders
5 files changed
+26
-24
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
636 | 636 | | |
637 | 637 | | |
638 | 638 | | |
| 639 | + | |
639 | 640 | | |
640 | 641 | | |
641 | 642 | | |
| |||
651 | 652 | | |
652 | 653 | | |
653 | 654 | | |
654 | | - | |
655 | | - | |
| 655 | + | |
656 | 656 | | |
657 | 657 | | |
658 | 658 | | |
| |||
6114 | 6114 | | |
6115 | 6115 | | |
6116 | 6116 | | |
| 6117 | + | |
6117 | 6118 | | |
6118 | 6119 | | |
6119 | 6120 | | |
| |||
6181 | 6182 | | |
6182 | 6183 | | |
6183 | 6184 | | |
6184 | | - | |
| 6185 | + | |
6185 | 6186 | | |
6186 | 6187 | | |
6187 | 6188 | | |
| |||
6351 | 6352 | | |
6352 | 6353 | | |
6353 | 6354 | | |
| 6355 | + | |
| 6356 | + | |
6354 | 6357 | | |
6355 | 6358 | | |
6356 | 6359 | | |
6357 | 6360 | | |
6358 | 6361 | | |
6359 | | - | |
| 6362 | + | |
6360 | 6363 | | |
6361 | 6364 | | |
6362 | 6365 | | |
6363 | 6366 | | |
6364 | | - | |
| 6367 | + | |
6365 | 6368 | | |
6366 | 6369 | | |
6367 | 6370 | | |
| |||
10306 | 10309 | | |
10307 | 10310 | | |
10308 | 10311 | | |
10309 | | - | |
10310 | | - | |
10311 | | - | |
10312 | | - | |
10313 | | - | |
10314 | | - | |
10315 | 10312 | | |
10316 | 10313 | | |
10317 | 10314 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
101 | 101 | | |
102 | 102 | | |
103 | 103 | | |
104 | | - | |
105 | | - | |
| 104 | + | |
| 105 | + | |
106 | 106 | | |
107 | 107 | | |
108 | 108 | | |
| |||
149 | 149 | | |
150 | 150 | | |
151 | 151 | | |
152 | | - | |
| 152 | + | |
153 | 153 | | |
154 | 154 | | |
155 | 155 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
| 28 | + | |
28 | 29 | | |
29 | 30 | | |
30 | 31 | | |
| |||
40 | 41 | | |
41 | 42 | | |
42 | 43 | | |
43 | | - | |
44 | | - | |
| 44 | + | |
45 | 45 | | |
46 | 46 | | |
47 | 47 | | |
| |||
50 | 50 | | |
51 | 51 | | |
52 | 52 | | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
53 | 56 | | |
54 | 57 | | |
55 | 58 | | |
| |||
100 | 103 | | |
101 | 104 | | |
102 | 105 | | |
103 | | - | |
104 | | - | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
105 | 110 | | |
106 | 111 | | |
107 | 112 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
126 | 126 | | |
127 | 127 | | |
128 | 128 | | |
129 | | - | |
130 | | - | |
| 129 | + | |
| 130 | + | |
131 | 131 | | |
132 | 132 | | |
133 | 133 | | |
| |||
182 | 182 | | |
183 | 183 | | |
184 | 184 | | |
185 | | - | |
| 185 | + | |
186 | 186 | | |
187 | 187 | | |
188 | 188 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
131 | 131 | | |
132 | 132 | | |
133 | 133 | | |
134 | | - | |
135 | | - | |
| 134 | + | |
| 135 | + | |
136 | 136 | | |
137 | 137 | | |
138 | 138 | | |
| |||
153 | 153 | | |
154 | 154 | | |
155 | 155 | | |
156 | | - | |
| 156 | + | |
157 | 157 | | |
158 | 158 | | |
159 | 159 | | |
| |||
0 commit comments