Retain alignment when lowering groupshared matrices #6589

pow2clk · 2024-05-06T17:19:08Z

When flattening the global for a groupshared matrix, the alignment information was getting lost. As a result, the alignments of the loads and stores were calculating their own alignment based on preferred alignment and trailing zeros of the index. The preferred alignment switched to 16 when the type size was over 128 bits due to a heuristic whose rationale is lost to time. When the global has its own alignment, that gets used, so by retaining it through lowering, the alignments are consistent and reliable.

Includes testing for a few matrix variants

fixes #6416

When flattening the global for a groupshared matrix, the alignment information was getting lost. As a result, the alignments of the loads and stores were calculating their own alignment based on preferred alignment and trailing zeros of the index. The preferred alignment switched to 16 when the type size was over 128 bits due to a heuristic whose rationale is lost to time. When the global has its own alignment, that gets used, so by retaining it through lowering, the alignments are consistent and reliable. Includes testing for a few matrix variants fixes microsoft#6416

...LSLFileCheck/hlsl/types/modifiers/groupshared/groupshared-member-matrix-subscript-align.hlsl

dmpots

There is a possibility of a perf regression with this change as it may prevent generating larger loads in some backends. Should we set the alignment to what it was assumed to be previously?

...LSLFileCheck/hlsl/types/modifiers/groupshared/groupshared-member-matrix-subscript-align.hlsl

Added verification that dimensions and variables are consistent

pow2clk · 2024-05-06T17:52:43Z

There is a possibility of a perf regression with this change as it may prevent generating larger loads in some backends. Should we set the alignment to what it was assumed to be previously?

It's certainly an option. What I have here preserves the alignment that is present from the start. I'm a bit more nervous about interjecting the alignment that the zero index happened to get when loads and stores were generated from the beginning. I fear that might have more unforeseen consequences than this does, but I can experiment with it.

To keep consistent with previous behavior, use the preferred alignment calculation when assigning alignment to the matrix groupshared instead of preserving the original value.

Add half and double variants for test cases Add pass test for hlmatrixlower

dmpots · 2024-05-21T18:10:16Z

There is a possibility of a perf regression with this change as it may prevent generating larger loads in some backends. Should we set the alignment to what it was assumed to be previously?

It's certainly an option. What I have here preserves the alignment that is present from the start. I'm a bit more nervous about interjecting the alignment that the zero index happened to get when loads and stores were generated from the beginning. I fear that might have more unforeseen consequences than this does, but I can experiment with it.

In the updated PR we are using the datalayout to compute the preferred alignment. This looks like the correct way to do it to me.

…rosoft#6589) When flattening the global for a groupshared matrix, the alignment information was getting lost. As a result, the alignments of the loads and stores were calculating their own alignment based on preferred alignment and trailing zeros of the index. The preferred alignment switched to 16 when the type size was over 128 bits due to a heuristic whose rationale is lost to time. When the global has its own alignment, that gets used, so by calculating it at lowering, the alignments are consistent and reliable. Includes testing for a few matrix variants and a pass test. fixes microsoft#6416 (cherry picked from commit a6f4025)

tex3d · 2024-05-22T20:42:13Z

...LSLFileCheck/hlsl/types/modifiers/groupshared/groupshared-member-matrix-subscript-align.hlsl

+// RUN: %dxc -DTYPE=float1x4 /Tcs_6_0 %s | FileCheck %s -check-prefixes=CHECK,CHECK4
+// RUN: %dxc -DTYPE=float2x2 /Tcs_6_0 %s | FileCheck %s -check-prefixes=CHECK,CHECK4
+// RUN: %dxc -DTYPE=double4x4 /Tcs_6_0 %s | FileCheck %s -check-prefixes=CHECK,LCHECK,CHECK8,LCHECKF
+// RUN: %dxc -DTYPE=float16_t4x4 /Tcs_6_2 %s -enable-16bit-types | FileCheck %s -check-prefixes=CHECK,LCHECK,CHECK2,LCHECKF


I think it might be important to also test with a matrix array, non-zero index, and odd matrix sizes, like (3x3, 1x3, 3x1). Since the alignment of the matrix is considered to be the alignment of the element, different array elements for a matrix whose size isn't an even multiple of 16 bytes won't be aligned by that amount.

…rosoft#6589) When flattening the global for a groupshared matrix, the alignment information was getting lost. As a result, the alignments of the loads and stores were calculating their own alignment based on preferred alignment and trailing zeros of the index. The preferred alignment switched to 16 when the type size was over 128 bits due to a heuristic whose rationale is lost to time. When the global has its own alignment, that gets used, so by calculating it at lowering, the alignments are consistent and reliable. Includes testing for a few matrix variants and a pass test. fixes microsoft#6416

pow2clk requested a review from dmpots May 6, 2024 17:19

pow2clk requested a review from a team as a code owner May 6, 2024 17:19

llvm-beanz reviewed May 6, 2024

View reviewed changes

...LSLFileCheck/hlsl/types/modifiers/groupshared/groupshared-member-matrix-subscript-align.hlsl Show resolved Hide resolved

dmpots reviewed May 6, 2024

View reviewed changes

...LSLFileCheck/hlsl/types/modifiers/groupshared/groupshared-member-matrix-subscript-align.hlsl Outdated Show resolved Hide resolved

constrain tests just a bit more

d2b15bd

Added verification that dimensions and variables are consistent

Greg Roth added 3 commits May 6, 2024 13:42

change how default groupshared alignment is calculated

7f44666

To keep consistent with previous behavior, use the preferred alignment calculation when assigning alignment to the matrix groupshared instead of preserving the original value.

Add more matrix element type tests

c65c66a

Add half and double variants for test cases Add pass test for hlmatrixlower

update existing matrix test for new alignment

7cde004

dmpots approved these changes May 21, 2024

View reviewed changes

llvm-beanz approved these changes May 22, 2024

View reviewed changes

pow2clk merged commit a6f4025 into microsoft:main May 22, 2024

pow2clk deleted the gs_mat_ldst branch May 22, 2024 20:38

tex3d reviewed May 22, 2024

View reviewed changes

pow2clk mentioned this pull request May 22, 2024

Calculate preferred alignment when lowering groupshared matrices #6645

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Retain alignment when lowering groupshared matrices #6589

Retain alignment when lowering groupshared matrices #6589

Uh oh!

pow2clk commented May 6, 2024

Uh oh!

Uh oh!

dmpots left a comment

Uh oh!

Uh oh!

pow2clk commented May 6, 2024

Uh oh!

dmpots commented May 21, 2024

Uh oh!

tex3d May 22, 2024

Uh oh!

Uh oh!

Retain alignment when lowering groupshared matrices #6589

Retain alignment when lowering groupshared matrices #6589

Uh oh!

Conversation

pow2clk commented May 6, 2024

Uh oh!

Uh oh!

dmpots left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pow2clk commented May 6, 2024

Uh oh!

dmpots commented May 21, 2024

Uh oh!

tex3d May 22, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!