This test was run at 31/03/2025 11:24:29
Results for a cyclic layout with add C
benchmark | layout | add C | M | N | K | plots | cycles | ideal | utilization |
---|---|---|---|---|---|---|---|---|---|
dense_gemm_cyclic_32x32x32 | cyclic | yes | 32 | 32 | 32 | yes | 116 | 80 | 0.6896551724137931 |
dense_gemm_cyclic_32x32x48 | cyclic | yes | 32 | 32 | 48 | yes | 157 | 112 | 0.7133757961783439 |
dense_gemm_cyclic_32x32x64 | cyclic | yes | 32 | 32 | 64 | no | 183 | 144 | 0.7868852459016393 |
dense_gemm_cyclic_32x48x32 | cyclic | yes | 32 | 48 | 32 | yes | 172 | 120 | 0.6976744186046512 |
dense_gemm_cyclic_32x48x48 | cyclic | yes | 32 | 48 | 48 | yes | 232 | 168 | 0.7241379310344828 |
dense_gemm_cyclic_32x48x64 | cyclic | yes | 32 | 48 | 64 | no | 271 | 216 | 0.7970479704797048 |
dense_gemm_cyclic_32x64x32 | cyclic | yes | 32 | 64 | 32 | no | 228 | 160 | 0.7017543859649122 |
dense_gemm_cyclic_32x64x48 | cyclic | yes | 32 | 64 | 48 | no | 307 | 224 | 0.7296416938110749 |
dense_gemm_cyclic_32x64x64 | cyclic | yes | 32 | 64 | 64 | no | 359 | 288 | 0.8022284122562674 |
dense_gemm_cyclic_48x32x32 | cyclic | yes | 48 | 32 | 32 | yes | 172 | 120 | 0.6976744186046512 |
dense_gemm_cyclic_48x32x48 | cyclic | yes | 48 | 32 | 48 | yes | 232 | 168 | 0.7241379310344828 |
dense_gemm_cyclic_48x32x64 | cyclic | yes | 48 | 32 | 64 | no | 271 | 216 | 0.7970479704797048 |
dense_gemm_cyclic_48x48x32 | cyclic | yes | 48 | 48 | 32 | yes | 256 | 180 | 0.703125 |
dense_gemm_cyclic_48x48x48 | cyclic | yes | 48 | 48 | 48 | yes | 345 | 252 | 0.7304347826086957 |
dense_gemm_cyclic_48x48x64 | cyclic | yes | 48 | 48 | 64 | no | 403 | 324 | 0.8039702233250621 |
dense_gemm_cyclic_48x64x32 | cyclic | yes | 48 | 64 | 32 | no | 340 | 240 | 0.7058823529411765 |
dense_gemm_cyclic_48x64x48 | cyclic | yes | 48 | 64 | 48 | no | 458 | 336 | 0.7336244541484717 |
dense_gemm_cyclic_48x64x64 | cyclic | yes | 48 | 64 | 64 | no | 535 | 432 | 0.8074766355140187 |
dense_gemm_cyclic_64x32x32 | cyclic | yes | 64 | 32 | 32 | no | 228 | 160 | 0.7017543859649122 |
dense_gemm_cyclic_64x32x48 | cyclic | yes | 64 | 32 | 48 | no | 308 | 224 | 0.7272727272727273 |
dense_gemm_cyclic_64x32x64 | cyclic | yes | 64 | 32 | 64 | no | 359 | 288 | 0.8022284122562674 |
dense_gemm_cyclic_64x48x32 | cyclic | yes | 64 | 48 | 32 | no | 340 | 240 | 0.7058823529411765 |
dense_gemm_cyclic_64x48x48 | cyclic | yes | 64 | 48 | 48 | no | 459 | 336 | 0.7320261437908496 |
dense_gemm_cyclic_64x48x64 | cyclic | yes | 64 | 48 | 64 | no | 535 | 432 | 0.8074766355140187 |
dense_gemm_cyclic_64x64x32 | cyclic | yes | 64 | 64 | 32 | no | 452 | 320 | 0.7079646017699115 |
dense_gemm_cyclic_64x64x48 | cyclic | yes | 64 | 64 | 48 | no | 610 | 448 | 0.7344262295081967 |
dense_gemm_cyclic_64x64x64 | cyclic | yes | 64 | 64 | 64 | no | 711 | 576 | 0.810126582278481 |
average | 0.7435160320962104 |
Results for a cyclic layout
benchmark | layout | add C | M | N | K | plots | cycles | ideal | utilization |
---|---|---|---|---|---|---|---|---|---|
dense_matmul_cyclic_32x32x32 | cyclic | no | 32 | 32 | 32 | yes | 111 | 80 | 0.7207207207207207 |
dense_matmul_cyclic_32x32x48 | cyclic | no | 32 | 32 | 48 | yes | 146 | 112 | 0.7671232876712328 |
dense_matmul_cyclic_32x32x64 | cyclic | no | 32 | 32 | 64 | no | 182 | 144 | 0.7912087912087912 |
dense_matmul_cyclic_32x48x32 | cyclic | no | 32 | 48 | 32 | yes | 163 | 120 | 0.7361963190184049 |
dense_matmul_cyclic_32x48x48 | cyclic | no | 32 | 48 | 48 | yes | 214 | 168 | 0.7850467289719626 |
dense_matmul_cyclic_32x48x64 | cyclic | no | 32 | 48 | 64 | no | 270 | 216 | 0.8 |
dense_matmul_cyclic_32x64x32 | cyclic | no | 32 | 64 | 32 | no | 215 | 160 | 0.7441860465116279 |
dense_matmul_cyclic_32x64x48 | cyclic | no | 32 | 64 | 48 | no | 282 | 224 | 0.7943262411347518 |
dense_matmul_cyclic_32x64x64 | cyclic | no | 32 | 64 | 64 | no | 358 | 288 | 0.8044692737430168 |
dense_matmul_cyclic_48x32x32 | cyclic | no | 48 | 32 | 32 | yes | 163 | 120 | 0.7361963190184049 |
dense_matmul_cyclic_48x32x48 | cyclic | no | 48 | 32 | 48 | yes | 214 | 168 | 0.7850467289719626 |
dense_matmul_cyclic_48x32x64 | cyclic | no | 48 | 32 | 64 | no | 270 | 216 | 0.8 |
dense_matmul_cyclic_48x48x32 | cyclic | no | 48 | 48 | 32 | yes | 241 | 180 | 0.7468879668049793 |
dense_matmul_cyclic_48x48x48 | cyclic | no | 48 | 48 | 48 | yes | 316 | 252 | 0.7974683544303798 |
dense_matmul_cyclic_48x48x64 | cyclic | no | 48 | 48 | 64 | no | 402 | 324 | 0.8059701492537313 |
dense_matmul_cyclic_48x64x32 | cyclic | no | 48 | 64 | 32 | no | 319 | 240 | 0.7523510971786834 |
dense_matmul_cyclic_48x64x48 | cyclic | no | 48 | 64 | 48 | no | 418 | 336 | 0.8038277511961722 |
dense_matmul_cyclic_48x64x64 | cyclic | no | 48 | 64 | 64 | no | 534 | 432 | 0.8089887640449438 |
dense_matmul_cyclic_64x32x32 | cyclic | no | 64 | 32 | 32 | no | 215 | 160 | 0.7441860465116279 |
dense_matmul_cyclic_64x32x48 | cyclic | no | 64 | 32 | 48 | no | 282 | 224 | 0.7943262411347518 |
dense_matmul_cyclic_64x32x64 | cyclic | no | 64 | 32 | 64 | no | 358 | 288 | 0.8044692737430168 |
dense_matmul_cyclic_64x48x32 | cyclic | no | 64 | 48 | 32 | no | 319 | 240 | 0.7523510971786834 |
dense_matmul_cyclic_64x48x48 | cyclic | no | 64 | 48 | 48 | no | 418 | 336 | 0.8038277511961722 |
dense_matmul_cyclic_64x48x64 | cyclic | no | 64 | 48 | 64 | no | 534 | 432 | 0.8089887640449438 |
dense_matmul_cyclic_64x64x32 | cyclic | no | 64 | 64 | 32 | no | 423 | 320 | 0.7565011820330969 |
dense_matmul_cyclic_64x64x48 | cyclic | no | 64 | 64 | 48 | no | 554 | 448 | 0.8086642599277978 |
dense_matmul_cyclic_64x64x64 | cyclic | no | 64 | 64 | 64 | no | 710 | 576 | 0.8112676056338028 |
average | 0.7801702504179134 |