[Codegen][llvmgpu] Compute gemmC size when C promotion is done in padding matmul #19307

jerryyin · 2024-11-26T21:23:09Z

This PR depends on #19271. Please review the last commit only.

The existing implementation of #19271 doesn't take gemmC into consideration when computing shared memory size. Though in condition of #19271, gemmC always get promoted and we ended always allocating the C tensor in shared memory. Ignoring C tensor will severely underestimate the amount of shared memory used and eventually cause deduceMMASchedule() to pick a tile size too large and go beyond the space limit for shared memory.

6c9ffd2 addressed this by adding calculateResultSharedMemoryUsedInBytes() and apply it to matmul tiling size derive process.

Signed-off-by: Nirvedh <[email protected]>

- This is a followup of iree-org#19271 - This take gemmC promotion into consideration in computing shared memory consumption Signed-off-by: jerryyin <[email protected]>

nirvedhmeshram and others added 3 commits November 22, 2024 14:07

Add c promotion capability to promote matmul operands pass

7486844

Signed-off-by: Nirvedh <[email protected]>

[GPU] Add config and pass to do padding

1586d53

Signed-off-by: Nirvedh <[email protected]>

Compute gemmC size when C promotion is done in padding matmul

6c9ffd2

- This is a followup of iree-org#19271 - This take gemmC promotion into consideration in computing shared memory consumption Signed-off-by: jerryyin <[email protected]>

jerryyin requested review from MaheshRavishankar, qedawkins, kuhar, Groverkss and antiagainst as code owners November 26, 2024 21:23

jerryyin changed the title ~~Compute gemmC size when C promotion is done in padding matmul~~ [Codegen][llvmgpu] Compute gemmC size when C promotion is done in padding matmul Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Codegen][llvmgpu] Compute gemmC size when C promotion is done in padding matmul #19307

[Codegen][llvmgpu] Compute gemmC size when C promotion is done in padding matmul #19307

jerryyin commented Nov 26, 2024 •

edited

Loading

[Codegen][llvmgpu] Compute gemmC size when C promotion is done in padding matmul #19307

Are you sure you want to change the base?

[Codegen][llvmgpu] Compute gemmC size when C promotion is done in padding matmul #19307

Conversation

jerryyin commented Nov 26, 2024 • edited Loading

jerryyin commented Nov 26, 2024 •

edited

Loading