This test was run at 07/02/2025 13:43:40
Results for a cyclic layout with add C
benchmark | layout | add C | M | N | K | plots | cycles | ideal | utilization |
---|---|---|---|---|---|---|---|---|---|
dense_gemm_cyclic_32x32x32 | cyclic | yes | 32 | 32 | 32 | yes | 117 | 80 | 0.6837606837606838 |
dense_gemm_cyclic_32x32x48 | cyclic | yes | 32 | 32 | 48 | yes | 153 | 112 | 0.7320261437908496 |
dense_gemm_cyclic_32x32x64 | cyclic | yes | 32 | 32 | 64 | no | 192 | 144 | 0.75 |
dense_gemm_cyclic_32x48x32 | cyclic | yes | 32 | 48 | 32 | yes | 173 | 120 | 0.6936416184971098 |
dense_gemm_cyclic_32x48x48 | cyclic | yes | 32 | 48 | 48 | yes | 223 | 168 | 0.7533632286995515 |
dense_gemm_cyclic_32x48x64 | cyclic | yes | 32 | 48 | 64 | no | 288 | 216 | 0.75 |
dense_gemm_cyclic_32x64x32 | cyclic | yes | 32 | 64 | 32 | no | 229 | 160 | 0.6986899563318777 |
dense_gemm_cyclic_32x64x48 | cyclic | yes | 32 | 64 | 48 | no | 293 | 224 | 0.764505119453925 |
dense_gemm_cyclic_32x64x64 | cyclic | yes | 32 | 64 | 64 | no | 384 | 288 | 0.75 |
dense_gemm_cyclic_48x32x32 | cyclic | yes | 48 | 32 | 32 | yes | 173 | 120 | 0.6936416184971098 |
dense_gemm_cyclic_48x32x48 | cyclic | yes | 48 | 32 | 48 | yes | 223 | 168 | 0.7533632286995515 |
dense_gemm_cyclic_48x32x64 | cyclic | yes | 48 | 32 | 64 | no | 288 | 216 | 0.75 |
dense_gemm_cyclic_48x48x32 | cyclic | yes | 48 | 48 | 32 | yes | 257 | 180 | 0.7003891050583657 |
dense_gemm_cyclic_48x48x48 | cyclic | yes | 48 | 48 | 48 | yes | 328 | 252 | 0.7682926829268293 |
dense_gemm_cyclic_48x48x64 | cyclic | yes | 48 | 48 | 64 | no | 432 | 324 | 0.75 |
dense_gemm_cyclic_48x64x32 | cyclic | yes | 48 | 64 | 32 | no | 341 | 240 | 0.7038123167155426 |
dense_gemm_cyclic_48x64x48 | cyclic | yes | 48 | 64 | 48 | no | 433 | 336 | 0.7759815242494227 |
dense_gemm_cyclic_48x64x64 | cyclic | yes | 48 | 64 | 64 | no | 576 | 432 | 0.75 |
dense_gemm_cyclic_64x32x32 | cyclic | yes | 64 | 32 | 32 | no | 229 | 160 | 0.6986899563318777 |
dense_gemm_cyclic_64x32x48 | cyclic | yes | 64 | 32 | 48 | no | 293 | 224 | 0.764505119453925 |
dense_gemm_cyclic_64x32x64 | cyclic | yes | 64 | 32 | 64 | no | 384 | 288 | 0.75 |
dense_gemm_cyclic_64x48x32 | cyclic | yes | 64 | 48 | 32 | no | 341 | 240 | 0.7038123167155426 |
dense_gemm_cyclic_64x48x48 | cyclic | yes | 64 | 48 | 48 | no | 433 | 336 | 0.7759815242494227 |
dense_gemm_cyclic_64x48x64 | cyclic | yes | 64 | 48 | 64 | no | 576 | 432 | 0.75 |
dense_gemm_cyclic_64x64x32 | cyclic | yes | 64 | 64 | 32 | no | 453 | 320 | 0.7064017660044151 |
dense_gemm_cyclic_64x64x48 | cyclic | yes | 64 | 64 | 48 | no | 573 | 448 | 0.7818499127399651 |
dense_gemm_cyclic_64x64x64 | cyclic | yes | 64 | 64 | 64 | no | 768 | 576 | 0.75 |
average | 0.737137326747258 |
Results for a cyclic layout
benchmark | layout | add C | M | N | K | plots | cycles | ideal | utilization |
---|---|---|---|---|---|---|---|---|---|
dense_matmul_cyclic_32x32x32 | cyclic | no | 32 | 32 | 32 | yes | 95 | 80 | 0.8421052631578947 |
dense_matmul_cyclic_32x32x48 | cyclic | no | 32 | 32 | 48 | yes | 128 | 112 | 0.875 |
dense_matmul_cyclic_32x32x64 | cyclic | no | 32 | 32 | 64 | no | 164 | 144 | 0.8780487804878049 |
dense_matmul_cyclic_32x48x32 | cyclic | no | 32 | 48 | 32 | yes | 139 | 120 | 0.8633093525179856 |
dense_matmul_cyclic_32x48x48 | cyclic | no | 32 | 48 | 48 | yes | 188 | 168 | 0.8936170212765957 |
dense_matmul_cyclic_32x48x64 | cyclic | no | 32 | 48 | 64 | no | 244 | 216 | 0.8852459016393442 |
dense_matmul_cyclic_32x64x32 | cyclic | no | 32 | 64 | 32 | no | 183 | 160 | 0.8743169398907104 |
dense_matmul_cyclic_32x64x48 | cyclic | no | 32 | 64 | 48 | no | 248 | 224 | 0.9032258064516129 |
dense_matmul_cyclic_32x64x64 | cyclic | no | 32 | 64 | 64 | no | 324 | 288 | 0.8888888888888888 |
dense_matmul_cyclic_48x32x32 | cyclic | no | 48 | 32 | 32 | yes | 139 | 120 | 0.8633093525179856 |
dense_matmul_cyclic_48x32x48 | cyclic | no | 48 | 32 | 48 | yes | 190 | 168 | 0.8842105263157894 |
dense_matmul_cyclic_48x32x64 | cyclic | no | 48 | 32 | 64 | no | 244 | 216 | 0.8852459016393442 |
dense_matmul_cyclic_48x48x32 | cyclic | no | 48 | 48 | 32 | yes | 205 | 180 | 0.8780487804878049 |
dense_matmul_cyclic_48x48x48 | cyclic | no | 48 | 48 | 48 | yes | 280 | 252 | 0.9 |
dense_matmul_cyclic_48x48x64 | cyclic | no | 48 | 48 | 64 | no | 364 | 324 | 0.8901098901098901 |
dense_matmul_cyclic_48x64x32 | cyclic | no | 48 | 64 | 32 | no | 271 | 240 | 0.8856088560885609 |
dense_matmul_cyclic_48x64x48 | cyclic | no | 48 | 64 | 48 | no | 370 | 336 | 0.9081081081081082 |
dense_matmul_cyclic_48x64x64 | cyclic | no | 48 | 64 | 64 | no | 484 | 432 | 0.8925619834710744 |
dense_matmul_cyclic_64x32x32 | cyclic | no | 64 | 32 | 32 | no | 183 | 160 | 0.8743169398907104 |
dense_matmul_cyclic_64x32x48 | cyclic | no | 64 | 32 | 48 | no | 249 | 224 | 0.8995983935742972 |
dense_matmul_cyclic_64x32x64 | cyclic | no | 64 | 32 | 64 | no | 324 | 288 | 0.8888888888888888 |
dense_matmul_cyclic_64x48x32 | cyclic | no | 64 | 48 | 32 | no | 271 | 240 | 0.8856088560885609 |
dense_matmul_cyclic_64x48x48 | cyclic | no | 64 | 48 | 48 | no | 369 | 336 | 0.9105691056910569 |
dense_matmul_cyclic_64x48x64 | cyclic | no | 64 | 48 | 64 | no | 484 | 432 | 0.8925619834710744 |
dense_matmul_cyclic_64x64x32 | cyclic | no | 64 | 64 | 32 | no | 359 | 320 | 0.8913649025069638 |
dense_matmul_cyclic_64x64x48 | cyclic | no | 64 | 64 | 48 | no | 489 | 448 | 0.9161554192229039 |
dense_matmul_cyclic_64x64x64 | cyclic | no | 64 | 64 | 64 | no | 644 | 576 | 0.8944099378881988 |
average | 0.8868309548248908 |