Scale P and Q with L2 cache size for SVE #4397

Mousius · 2023-12-27T18:38:04Z

The defaults in param.h now reflect an L2 size of 128KB, and that is scaled based on the actual size.

Mousius · 2023-12-27T18:39:08Z

@martin-frbg , this is closer to what I was thinking previously, what do you think? I can see others have done similar in setparam-ref.c but I'm assuming that doesn't work when building for a specific core outside of DYNAMIC_ARCH?

martin-frbg · 2023-12-27T19:18:39Z

yes, for a specific cpu TARGET build I think the factor would have to be applied in common_param.h but I have limited brain capacity for that right now

Mousius · 2023-12-29T18:38:05Z

Thanks @martin-frbg, I'll look into it 😸 !

DhanusML · 2024-01-19T10:14:43Z

param.h

@@ -3517,13 +3517,13 @@ Until then, just keep it different than DGEMM_DEFAULT_UNROLL_N to keep copy rout
 #define ZGEMM_DEFAULT_UNROLL_N  4
 #define ZGEMM_DEFAULT_UNROLL_MN  16

-#define SGEMM_DEFAULT_P 128
-#define DGEMM_DEFAULT_P 160
+#define SGEMM_DEFAULT_P 30


How were the default P and Q chosen for 128KB cache?

#4381 demonstrated values that worked well for a 1MB L2 cache, so I divided that by 8.

If you have a more scientific approach, I'd be happy to hear it 😸

Scale P and Q with L2 cache size for SVE

75fe9c2

The defaults in param.h now reflect an L2 size of 128KB, and that is scaled based on the actual size.

Mousius mentioned this pull request Jan 16, 2024

ARMV8SVE dgemm kernel on Nvidia Grace slightly slower than generic ARMV8 kernel #4440

Closed

DhanusML reviewed Jan 19, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scale P and Q with L2 cache size for SVE #4397

Scale P and Q with L2 cache size for SVE #4397

Mousius commented Dec 27, 2023

Mousius commented Dec 27, 2023

martin-frbg commented Dec 27, 2023

Mousius commented Dec 29, 2023

DhanusML Jan 19, 2024

Mousius Jan 19, 2024

Scale P and Q with L2 cache size for SVE #4397

Are you sure you want to change the base?

Scale P and Q with L2 cache size for SVE #4397

Conversation

Mousius commented Dec 27, 2023

Mousius commented Dec 27, 2023

martin-frbg commented Dec 27, 2023

Mousius commented Dec 29, 2023

DhanusML Jan 19, 2024

Choose a reason for hiding this comment

Mousius Jan 19, 2024

Choose a reason for hiding this comment