This test was run at 10/07/2025 15:17:04
Results for a cyclic layout with add C
benchmark | layout | add C | M | N | K | plots | cycles | ideal | utilization |
---|---|---|---|---|---|---|---|---|---|
dense_gemm_cyclic_32x32x32 | cyclic | yes | 32 | 32 | 32 | yes | 116 | 80 | 0.6896551724137931 |
dense_gemm_cyclic_32x32x48 | cyclic | yes | 32 | 32 | 48 | yes | 157 | 112 | 0.7133757961783439 |
dense_gemm_cyclic_32x32x64 | cyclic | yes | 32 | 32 | 64 | no | 183 | 144 | 0.7868852459016393 |
dense_gemm_cyclic_32x48x32 | cyclic | yes | 32 | 48 | 32 | yes | 172 | 120 | 0.6976744186046512 |
dense_gemm_cyclic_32x48x48 | cyclic | yes | 32 | 48 | 48 | yes | 232 | 168 | 0.7241379310344828 |
dense_gemm_cyclic_32x48x64 | cyclic | yes | 32 | 48 | 64 | no | 271 | 216 | 0.7970479704797048 |
dense_gemm_cyclic_32x64x32 | cyclic | yes | 32 | 64 | 32 | no | 228 | 160 | 0.7017543859649122 |
dense_gemm_cyclic_32x64x48 | cyclic | yes | 32 | 64 | 48 | no | 307 | 224 | 0.7296416938110749 |
dense_gemm_cyclic_32x64x64 | cyclic | yes | 32 | 64 | 64 | no | 359 | 288 | 0.8022284122562674 |
dense_gemm_cyclic_48x32x32 | cyclic | yes | 48 | 32 | 32 | yes | 172 | 120 | 0.6976744186046512 |
dense_gemm_cyclic_48x32x48 | cyclic | yes | 48 | 32 | 48 | yes | 232 | 168 | 0.7241379310344828 |
dense_gemm_cyclic_48x32x64 | cyclic | yes | 48 | 32 | 64 | no | 271 | 216 | 0.7970479704797048 |
dense_gemm_cyclic_48x48x32 | cyclic | yes | 48 | 48 | 32 | yes | 256 | 180 | 0.703125 |
dense_gemm_cyclic_48x48x48 | cyclic | yes | 48 | 48 | 48 | yes | 345 | 252 | 0.7304347826086957 |
dense_gemm_cyclic_48x48x64 | cyclic | yes | 48 | 48 | 64 | no | 403 | 324 | 0.8039702233250621 |
dense_gemm_cyclic_48x64x32 | cyclic | yes | 48 | 64 | 32 | no | 340 | 240 | 0.7058823529411765 |
dense_gemm_cyclic_48x64x48 | cyclic | yes | 48 | 64 | 48 | no | 458 | 336 | 0.7336244541484717 |
dense_gemm_cyclic_48x64x64 | cyclic | yes | 48 | 64 | 64 | no | 535 | 432 | 0.8074766355140187 |
dense_gemm_cyclic_64x32x32 | cyclic | yes | 64 | 32 | 32 | no | 228 | 160 | 0.7017543859649122 |
dense_gemm_cyclic_64x32x48 | cyclic | yes | 64 | 32 | 48 | no | 308 | 224 | 0.7272727272727273 |
dense_gemm_cyclic_64x32x64 | cyclic | yes | 64 | 32 | 64 | no | 359 | 288 | 0.8022284122562674 |
dense_gemm_cyclic_64x48x32 | cyclic | yes | 64 | 48 | 32 | no | 340 | 240 | 0.7058823529411765 |
dense_gemm_cyclic_64x48x48 | cyclic | yes | 64 | 48 | 48 | no | 459 | 336 | 0.7320261437908496 |
dense_gemm_cyclic_64x48x64 | cyclic | yes | 64 | 48 | 64 | no | 535 | 432 | 0.8074766355140187 |
dense_gemm_cyclic_64x64x32 | cyclic | yes | 64 | 64 | 32 | no | 452 | 320 | 0.7079646017699115 |
dense_gemm_cyclic_64x64x48 | cyclic | yes | 64 | 64 | 48 | no | 610 | 448 | 0.7344262295081967 |
dense_gemm_cyclic_64x64x64 | cyclic | yes | 64 | 64 | 64 | no | 711 | 576 | 0.810126582278481 |
average | 0.7435160320962104 |
Results for a cyclic layout
benchmark | layout | add C | M | N | K | plots | cycles | ideal | utilization |
---|---|---|---|---|---|---|---|---|---|
dense_matmul_cyclic_32x32x32 | cyclic | no | 32 | 32 | 32 | yes | 93 | 80 | 0.8602150537634409 |
dense_matmul_cyclic_32x32x48 | cyclic | no | 32 | 32 | 48 | yes | 128 | 112 | 0.875 |
dense_matmul_cyclic_32x32x64 | cyclic | no | 32 | 32 | 64 | no | 164 | 144 | 0.8780487804878049 |
dense_matmul_cyclic_32x48x32 | cyclic | no | 32 | 48 | 32 | yes | 137 | 120 | 0.8759124087591241 |
dense_matmul_cyclic_32x48x48 | cyclic | no | 32 | 48 | 48 | yes | 188 | 168 | 0.8936170212765957 |
dense_matmul_cyclic_32x48x64 | cyclic | no | 32 | 48 | 64 | no | 244 | 216 | 0.8852459016393442 |
dense_matmul_cyclic_32x64x32 | cyclic | no | 32 | 64 | 32 | no | 181 | 160 | 0.8839779005524862 |
dense_matmul_cyclic_32x64x48 | cyclic | no | 32 | 64 | 48 | no | 248 | 224 | 0.9032258064516129 |
dense_matmul_cyclic_32x64x64 | cyclic | no | 32 | 64 | 64 | no | 324 | 288 | 0.8888888888888888 |
dense_matmul_cyclic_48x32x32 | cyclic | no | 48 | 32 | 32 | yes | 137 | 120 | 0.8759124087591241 |
dense_matmul_cyclic_48x32x48 | cyclic | no | 48 | 32 | 48 | yes | 190 | 168 | 0.8842105263157894 |
dense_matmul_cyclic_48x32x64 | cyclic | no | 48 | 32 | 64 | no | 244 | 216 | 0.8852459016393442 |
dense_matmul_cyclic_48x48x32 | cyclic | no | 48 | 48 | 32 | yes | 203 | 180 | 0.8866995073891626 |
dense_matmul_cyclic_48x48x48 | cyclic | no | 48 | 48 | 48 | yes | 280 | 252 | 0.9 |
dense_matmul_cyclic_48x48x64 | cyclic | no | 48 | 48 | 64 | no | 364 | 324 | 0.8901098901098901 |
dense_matmul_cyclic_48x64x32 | cyclic | no | 48 | 64 | 32 | no | 269 | 240 | 0.8921933085501859 |
dense_matmul_cyclic_48x64x48 | cyclic | no | 48 | 64 | 48 | no | 370 | 336 | 0.9081081081081082 |
dense_matmul_cyclic_48x64x64 | cyclic | no | 48 | 64 | 64 | no | 484 | 432 | 0.8925619834710744 |
dense_matmul_cyclic_64x32x32 | cyclic | no | 64 | 32 | 32 | no | 181 | 160 | 0.8839779005524862 |
dense_matmul_cyclic_64x32x48 | cyclic | no | 64 | 32 | 48 | no | 249 | 224 | 0.8995983935742972 |
dense_matmul_cyclic_64x32x64 | cyclic | no | 64 | 32 | 64 | no | 324 | 288 | 0.8888888888888888 |
dense_matmul_cyclic_64x48x32 | cyclic | no | 64 | 48 | 32 | no | 269 | 240 | 0.8921933085501859 |
dense_matmul_cyclic_64x48x48 | cyclic | no | 64 | 48 | 48 | no | 369 | 336 | 0.9105691056910569 |
dense_matmul_cyclic_64x48x64 | cyclic | no | 64 | 48 | 64 | no | 484 | 432 | 0.8925619834710744 |
dense_matmul_cyclic_64x64x32 | cyclic | no | 64 | 64 | 32 | no | 357 | 320 | 0.896358543417367 |
dense_matmul_cyclic_64x64x48 | cyclic | no | 64 | 64 | 48 | no | 489 | 448 | 0.9161554192229039 |
dense_matmul_cyclic_64x64x64 | cyclic | no | 64 | 64 | 64 | no | 644 | 576 | 0.8944099378881988 |
average | 0.8901439584229051 |