This test was run at 03/10/2024 16:25:52
Results for a cyclic layout
benchmark | layout | M | N | K | cycles | ideal | utilization |
---|---|---|---|---|---|---|---|
dense_matmul_cyclic_32x32x32 | cyclic | 32 | 32 | 32 | 93 | 80 | 0.8602150537634409 |
dense_matmul_cyclic_48x32x32 | cyclic | 32 | 48 | 32 | 133 | 120 | 0.9022556390977443 |
dense_matmul_cyclic_64x32x32 | cyclic | 32 | 64 | 32 | 173 | 160 | 0.9248554913294798 |
dense_matmul_cyclic_32x48x32 | cyclic | 48 | 32 | 32 | 133 | 120 | 0.9022556390977443 |
dense_matmul_cyclic_48x48x32 | cyclic | 48 | 48 | 32 | 193 | 180 | 0.9326424870466321 |
dense_matmul_cyclic_64x48x32 | cyclic | 48 | 64 | 32 | 259 | 240 | 0.9266409266409267 |
dense_matmul_cyclic_32x64x32 | cyclic | 64 | 32 | 32 | 173 | 160 | 0.9248554913294798 |
dense_matmul_cyclic_48x64x32 | cyclic | 64 | 48 | 32 | 253 | 240 | 0.9486166007905138 |
dense_matmul_cyclic_64x64x32 | cyclic | 64 | 64 | 32 | 333 | 320 | 0.960960960960961 |
dense_matmul_cyclic_32x32x48 | cyclic | 32 | 32 | 48 | 136 | 112 | 0.8235294117647058 |
dense_matmul_cyclic_48x32x48 | cyclic | 32 | 48 | 48 | 198 | 168 | 0.8484848484848485 |
dense_matmul_cyclic_64x32x48 | cyclic | 32 | 64 | 48 | 260 | 224 | 0.8615384615384616 |
dense_matmul_cyclic_32x48x48 | cyclic | 48 | 32 | 48 | 200 | 168 | 0.84 |
dense_matmul_cyclic_48x48x48 | cyclic | 48 | 48 | 48 | 294 | 252 | 0.8571428571428571 |
dense_matmul_cyclic_64x48x48 | cyclic | 48 | 64 | 48 | 388 | 336 | 0.865979381443299 |
dense_matmul_cyclic_32x64x48 | cyclic | 64 | 32 | 48 | 264 | 224 | 0.8484848484848485 |
dense_matmul_cyclic_48x64x48 | cyclic | 64 | 48 | 48 | 390 | 336 | 0.8615384615384616 |
dense_matmul_cyclic_64x64x48 | cyclic | 64 | 64 | 48 | 516 | 448 | 0.8682170542635659 |
dense_matmul_cyclic_32x32x64 | cyclic | 32 | 32 | 64 | 157 | 144 | 0.9171974522292994 |
dense_matmul_cyclic_48x32x64 | cyclic | 32 | 48 | 64 | 229 | 216 | 0.9432314410480349 |
dense_matmul_cyclic_64x32x64 | cyclic | 32 | 64 | 64 | 301 | 288 | 0.9568106312292359 |
dense_matmul_cyclic_32x48x64 | cyclic | 48 | 32 | 64 | 231 | 216 | 0.935064935064935 |
dense_matmul_cyclic_48x48x64 | cyclic | 48 | 48 | 64 | 337 | 324 | 0.9614243323442137 |
dense_matmul_cyclic_64x48x64 | cyclic | 48 | 64 | 64 | 445 | 432 | 0.9707865168539326 |
dense_matmul_cyclic_32x64x64 | cyclic | 64 | 32 | 64 | 301 | 288 | 0.9568106312292359 |
dense_matmul_cyclic_48x64x64 | cyclic | 64 | 48 | 64 | 447 | 432 | 0.9664429530201343 |
dense_matmul_cyclic_64x64x64 | cyclic | 64 | 64 | 64 | 591 | 576 | 0.9746192893401016 |
average | 0.9089111776695221 |
Results for a banked layout
benchmark | layout | M | N | K | cycles | ideal | utilization |
---|---|---|---|---|---|---|---|
dense_matmul_banked_32x32x32 | banked | 32 | 32 | 32 | 92 | 80 | 0.8695652173913043 |
dense_matmul_banked_48x32x32 | banked | 32 | 48 | 32 | 132 | 120 | 0.9090909090909091 |
dense_matmul_banked_64x32x32 | banked | 32 | 64 | 32 | 172 | 160 | 0.9302325581395349 |
dense_matmul_banked_32x48x32 | banked | 48 | 32 | 32 | 132 | 120 | 0.9090909090909091 |
dense_matmul_banked_48x48x32 | banked | 48 | 48 | 32 | 192 | 180 | 0.9375 |
dense_matmul_banked_64x48x32 | banked | 48 | 64 | 32 | 258 | 240 | 0.9302325581395349 |
dense_matmul_banked_32x64x32 | banked | 64 | 32 | 32 | 175 | 160 | 0.9142857142857143 |
dense_matmul_banked_48x64x32 | banked | 64 | 48 | 32 | 252 | 240 | 0.9523809523809523 |
dense_matmul_banked_64x64x32 | banked | 64 | 64 | 32 | 332 | 320 | 0.963855421686747 |
dense_matmul_banked_32x32x48 | banked | 32 | 32 | 48 | 124 | 112 | 0.9032258064516129 |
dense_matmul_banked_48x32x48 | banked | 32 | 48 | 48 | 186 | 168 | 0.9032258064516129 |
dense_matmul_banked_64x32x48 | banked | 32 | 64 | 48 | 236 | 224 | 0.9491525423728814 |
dense_matmul_banked_32x48x48 | banked | 48 | 32 | 48 | 180 | 168 | 0.9333333333333333 |
dense_matmul_banked_48x48x48 | banked | 48 | 48 | 48 | 264 | 252 | 0.9545454545454546 |
dense_matmul_banked_64x48x48 | banked | 48 | 64 | 48 | 351 | 336 | 0.9572649572649573 |
dense_matmul_banked_32x64x48 | banked | 64 | 32 | 48 | 236 | 224 | 0.9491525423728814 |
dense_matmul_banked_48x64x48 | banked | 64 | 48 | 48 | 348 | 336 | 0.9655172413793104 |
dense_matmul_banked_64x64x48 | banked | 64 | 64 | 48 | 460 | 448 | 0.9739130434782609 |
dense_matmul_banked_32x32x64 | banked | 32 | 32 | 64 | 156 | 144 | 0.9230769230769231 |
dense_matmul_banked_48x32x64 | banked | 32 | 48 | 64 | 228 | 216 | 0.9473684210526315 |
dense_matmul_banked_64x32x64 | banked | 32 | 64 | 64 | 306 | 288 | 0.9411764705882353 |
dense_matmul_banked_32x48x64 | banked | 48 | 32 | 64 | 228 | 216 | 0.9473684210526315 |
dense_matmul_banked_48x48x64 | banked | 48 | 48 | 64 | 336 | 324 | 0.9642857142857143 |
dense_matmul_banked_64x48x64 | banked | 48 | 64 | 64 | 444 | 432 | 0.972972972972973 |
dense_matmul_banked_32x64x64 | banked | 64 | 32 | 64 | 300 | 288 | 0.96 |
dense_matmul_banked_48x64x64 | banked | 64 | 48 | 64 | 444 | 432 | 0.972972972972973 |
dense_matmul_banked_64x64x64 | banked | 64 | 64 | 64 | 588 | 576 | 0.9795918367346939 |
average | 0.9412732852071364 |