snax-mlir

Dense Matmul Benchmark Results

This test was run at 03/10/2024 16:25:52

Results for a cyclic layout

benchmark layout M N K cycles ideal utilization
dense_matmul_cyclic_32x32x32 cyclic 32 32 32 93 80 0.8602150537634409
dense_matmul_cyclic_48x32x32 cyclic 32 48 32 133 120 0.9022556390977443
dense_matmul_cyclic_64x32x32 cyclic 32 64 32 173 160 0.9248554913294798
dense_matmul_cyclic_32x48x32 cyclic 48 32 32 133 120 0.9022556390977443
dense_matmul_cyclic_48x48x32 cyclic 48 48 32 193 180 0.9326424870466321
dense_matmul_cyclic_64x48x32 cyclic 48 64 32 259 240 0.9266409266409267
dense_matmul_cyclic_32x64x32 cyclic 64 32 32 173 160 0.9248554913294798
dense_matmul_cyclic_48x64x32 cyclic 64 48 32 253 240 0.9486166007905138
dense_matmul_cyclic_64x64x32 cyclic 64 64 32 333 320 0.960960960960961
dense_matmul_cyclic_32x32x48 cyclic 32 32 48 136 112 0.8235294117647058
dense_matmul_cyclic_48x32x48 cyclic 32 48 48 198 168 0.8484848484848485
dense_matmul_cyclic_64x32x48 cyclic 32 64 48 260 224 0.8615384615384616
dense_matmul_cyclic_32x48x48 cyclic 48 32 48 200 168 0.84
dense_matmul_cyclic_48x48x48 cyclic 48 48 48 294 252 0.8571428571428571
dense_matmul_cyclic_64x48x48 cyclic 48 64 48 388 336 0.865979381443299
dense_matmul_cyclic_32x64x48 cyclic 64 32 48 264 224 0.8484848484848485
dense_matmul_cyclic_48x64x48 cyclic 64 48 48 390 336 0.8615384615384616
dense_matmul_cyclic_64x64x48 cyclic 64 64 48 516 448 0.8682170542635659
dense_matmul_cyclic_32x32x64 cyclic 32 32 64 157 144 0.9171974522292994
dense_matmul_cyclic_48x32x64 cyclic 32 48 64 229 216 0.9432314410480349
dense_matmul_cyclic_64x32x64 cyclic 32 64 64 301 288 0.9568106312292359
dense_matmul_cyclic_32x48x64 cyclic 48 32 64 231 216 0.935064935064935
dense_matmul_cyclic_48x48x64 cyclic 48 48 64 337 324 0.9614243323442137
dense_matmul_cyclic_64x48x64 cyclic 48 64 64 445 432 0.9707865168539326
dense_matmul_cyclic_32x64x64 cyclic 64 32 64 301 288 0.9568106312292359
dense_matmul_cyclic_48x64x64 cyclic 64 48 64 447 432 0.9664429530201343
dense_matmul_cyclic_64x64x64 cyclic 64 64 64 591 576 0.9746192893401016
average     0.9089111776695221        

Results for a banked layout

benchmark layout M N K cycles ideal utilization
dense_matmul_banked_32x32x32 banked 32 32 32 92 80 0.8695652173913043
dense_matmul_banked_48x32x32 banked 32 48 32 132 120 0.9090909090909091
dense_matmul_banked_64x32x32 banked 32 64 32 172 160 0.9302325581395349
dense_matmul_banked_32x48x32 banked 48 32 32 132 120 0.9090909090909091
dense_matmul_banked_48x48x32 banked 48 48 32 192 180 0.9375
dense_matmul_banked_64x48x32 banked 48 64 32 258 240 0.9302325581395349
dense_matmul_banked_32x64x32 banked 64 32 32 175 160 0.9142857142857143
dense_matmul_banked_48x64x32 banked 64 48 32 252 240 0.9523809523809523
dense_matmul_banked_64x64x32 banked 64 64 32 332 320 0.963855421686747
dense_matmul_banked_32x32x48 banked 32 32 48 124 112 0.9032258064516129
dense_matmul_banked_48x32x48 banked 32 48 48 186 168 0.9032258064516129
dense_matmul_banked_64x32x48 banked 32 64 48 236 224 0.9491525423728814
dense_matmul_banked_32x48x48 banked 48 32 48 180 168 0.9333333333333333
dense_matmul_banked_48x48x48 banked 48 48 48 264 252 0.9545454545454546
dense_matmul_banked_64x48x48 banked 48 64 48 351 336 0.9572649572649573
dense_matmul_banked_32x64x48 banked 64 32 48 236 224 0.9491525423728814
dense_matmul_banked_48x64x48 banked 64 48 48 348 336 0.9655172413793104
dense_matmul_banked_64x64x48 banked 64 64 48 460 448 0.9739130434782609
dense_matmul_banked_32x32x64 banked 32 32 64 156 144 0.9230769230769231
dense_matmul_banked_48x32x64 banked 32 48 64 228 216 0.9473684210526315
dense_matmul_banked_64x32x64 banked 32 64 64 306 288 0.9411764705882353
dense_matmul_banked_32x48x64 banked 48 32 64 228 216 0.9473684210526315
dense_matmul_banked_48x48x64 banked 48 48 64 336 324 0.9642857142857143
dense_matmul_banked_64x48x64 banked 48 64 64 444 432 0.972972972972973
dense_matmul_banked_32x64x64 banked 64 32 64 300 288 0.96
dense_matmul_banked_48x64x64 banked 64 48 64 444 432 0.972972972972973
dense_matmul_banked_64x64x64 banked 64 64 64 588 576 0.9795918367346939
average     0.9412732852071364