Skip to content

CI run 1fe4f77

  • Run: link
  • Time: Denver 2025-08-28 09:04:50 MDT • Brussels 2025-08-28 17:04:50 CEST

GEMM Deployment Summary

  • Workflow: AIE Deployment Gemm
  • Commit: 1fe4f778eba482b55e090cd28e7a918bc16d993c
  • Runner: venus
  • Run time: Denver 2025-08-28 09:04:31 MDT • Brussels 2025-08-28 17:04:31 CEST
  • Run: #69; Attempt 1
HW M K N Rows Cols Status Note
single_col 128 128 128 2 1 🐬 success
single_col 128 128 256 2 1 🐬 success
single_col 128 128 64 2 1 🐬 success
single_col 128 256 128 2 1 🐬 success
single_col 128 256 256 2 1 🐬 success
single_col 128 256 64 2 1 🐬 success
single_col 128 64 128 2 1 🐬 success
single_col 128 64 256 2 1 🐬 success
single_col 128 64 64 2 1 🐬 success
single_col 256 128 128 2 1 🐬 success
single_col 256 128 256 2 1 🐬 success
single_col 256 128 64 2 1 🐬 success
single_col 256 256 128 2 1 🐬 success
single_col 256 256 256 2 1 🐬 success
single_col 256 256 64 2 1 🐬 success
single_col 256 64 128 2 1 🐬 success
single_col 256 64 256 2 1 🐬 success
single_col 256 64 64 2 1 🐬 success
single_col 64 128 128 2 1 🐬 success
single_col 64 128 256 2 1 🐬 success
single_col 64 128 64 2 1 🐬 success
single_col 64 256 128 2 1 🐬 success
single_col 64 256 256 2 1 🐬 success
single_col 64 256 64 2 1 🐬 success
single_col 64 64 128 2 1 🐬 success
single_col 64 64 256 2 1 🐬 success
single_col 64 64 64 2 1 🐬 success
single_core 128 128 128 1 1 🐬 success
single_core 128 128 256 1 1 🐬 success
single_core 128 128 64 1 1 🐬 success
single_core 128 256 128 1 1 🐬 success
single_core 128 256 256 1 1 🐬 success
single_core 128 256 64 1 1 🐬 success
single_core 128 64 128 1 1 🐬 success
single_core 128 64 256 1 1 🐬 success
single_core 128 64 64 1 1 🐬 success
single_core 256 128 128 1 1 🐬 success
single_core 256 128 256 1 1 🐬 success
single_core 256 128 64 1 1 🐬 success
single_core 256 256 128 1 1 🐬 success
single_core 256 256 256 1 1 🐬 success
single_core 256 256 64 1 1 🐬 success
single_core 256 64 128 1 1 🐬 success
single_core 256 64 256 1 1 🐬 success
single_core 256 64 64 1 1 🐬 success
single_core 64 128 128 1 1 🐬 success
single_core 64 128 256 1 1 🐬 success
single_core 64 128 64 1 1 🐬 success
single_core 64 256 128 1 1 🐬 success
single_core 64 256 256 1 1 🐬 success
single_core 64 256 64 1 1 🐬 success
single_core 64 64 128 1 1 🐬 success
single_core 64 64 256 1 1 🐬 success
single_core 64 64 64 1 1 🐬 success
whole_array 128 128 128 2 4 🐬 success
whole_array 128 128 256 2 4 ❌ failed Peano not added to PATH causing potential conflicts.
whole_array 128 128 64 2 4 ❌ failed missing status.ok
whole_array 128 256 128 2 4 🐬 success
whole_array 128 256 256 2 4 ❌ failed Peano not added to PATH causing execution issues.
whole_array 128 256 64 2 4 ❌ failed missing status.ok
whole_array 128 64 128 2 4 🐬 success
whole_array 128 64 256 2 4 ❌ failed Peano not added to PATH causing potential conflicts.
whole_array 128 64 64 2 4 ❌ failed missing status.ok
whole_array 256 128 128 2 4 ❌ failed Peano not added to PATH causing potential conflicts.
whole_array 256 128 256 2 4 ❌ failed Peano not added to PATH causing potential conflicts.
whole_array 256 128 64 2 4 ❌ failed missing status.ok
whole_array 256 256 128 2 4 ❌ failed Peano not added to PATH causing potential conflicts.
whole_array 256 256 256 2 4 ❌ failed Peano not added to PATH causing potential conflicts.
whole_array 256 256 64 2 4 ❌ failed missing status.ok
whole_array 256 64 128 2 4 ❌ failed Peano not added to PATH causing potential conflicts.
whole_array 256 64 256 2 4 ❌ failed Peano not added to PATH causing execution issues.
whole_array 256 64 64 2 4 ❌ failed missing status.ok
whole_array 64 128 128 2 4 🐬 success
whole_array 64 128 256 2 4 🐬 success
whole_array 64 128 64 2 4 ❌ failed missing status.ok
whole_array 64 256 128 2 4 🐬 success
whole_array 64 256 256 2 4 🐬 success
whole_array 64 256 64 2 4 ❌ failed missing status.ok
whole_array 64 64 128 2 4 🐬 success
whole_array 64 64 256 2 4 🐬 success
whole_array 64 64 64 2 4 ❌ failed missing status.ok

Totals: 🐬 63 • ❌ 18 • All: 81

[single_col] M=128 K=128 N=128 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 52,733 | 1,070.00 | 30.62 | 47.85 | 19.88 | 31.07 | | tile3,1 | 32 | 52,734 | 1,070.00 | 30.62 | 47.85 | 19.88 | 31.07 |
[single_col] M=128 K=128 N=256 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 109,938 | 1,070.00 | 30.62 | 47.85 | 19.08 | 29.81 | | tile3,1 | 64 | 109,939 | 1,070.00 | 30.62 | 47.85 | 19.08 | 29.81 |
[single_col] M=128 K=128 N=64 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 16 | 24,176 | 1,070.00 | 30.62 | 47.85 | 21.69 | 33.88 | | tile2,1 | 16 | 24,176 | 1,070.00 | 30.62 | 47.85 | 21.69 | 33.88 |
[single_col] M=128 K=256 N=128 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 113,286 | 1,058.00 | 30.97 | 48.39 | 18.51 | 28.93 | | tile3,1 | 64 | 113,286 | 1,058.00 | 30.97 | 48.39 | 18.51 | 28.93 |
[single_col] M=128 K=256 N=256 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 128 | 235,052 | 1,058.00 | 30.97 | 48.39 | 17.84 | 27.88 | | tile2,1 | 128 | 235,052 | 1,058.00 | 30.97 | 48.39 | 17.84 | 27.88 |
[single_col] M=128 K=256 N=64 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 52,411 | 1,058.00 | 30.97 | 48.39 | 20.01 | 31.26 | | tile3,1 | 32 | 52,412 | 1,058.00 | 30.97 | 48.39 | 20.01 | 31.26 |
[single_col] M=128 K=64 N=128 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 16 | 26,413 | 1,078.00 | 30.40 | 47.50 | 19.85 | 31.02 | | tile3,1 | 16 | 26,924 | 1,078.00 | 30.40 | 47.50 | 19.47 | 30.43 |
[single_col] M=128 K=64 N=256 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 54,591 | 1,078.00 | 30.40 | 47.50 | 19.21 | 30.01 | | tile3,1 | 32 | 55,110 | 1,078.00 | 30.40 | 47.50 | 19.03 | 29.73 |
[single_col] M=128 K=64 N=64 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 8 | 12,330 | 1,078.00 | 30.40 | 47.50 | 21.26 | 33.22 | | tile2,1 | 8 | 12,330 | 1,078.00 | 30.40 | 47.50 | 21.26 | 33.22 |
[single_col] M=256 K=128 N=128 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 105,638 | 1,070.00 | 30.62 | 47.85 | 19.85 | 31.02 | | tile3,1 | 64 | 105,638 | 1,070.00 | 30.62 | 47.85 | 19.85 | 31.02 |
[single_col] M=256 K=128 N=256 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 127 | 214,346 | 1,072.30 | 30.56 | 47.75 | 19.57 | 30.57 | | tile3,1 | 127 | 215,290 | 1,072.83 | 30.54 | 47.72 | 19.48 | 30.44 |
[single_col] M=256 K=128 N=64 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 32 | 50,629 | 1,070.00 | 30.62 | 47.85 | 20.71 | 32.36 | | tile2,1 | 32 | 50,630 | 1,070.00 | 30.62 | 47.85 | 20.71 | 32.36 |
[single_col] M=256 K=256 N=128 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 128 | 220,551 | 1,058.00 | 30.97 | 48.39 | 19.02 | 29.71 | | tile2,1 | 128 | 220,551 | 1,058.00 | 30.97 | 48.39 | 19.02 | 29.71 |
[single_col] M=256 K=256 N=256 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 255 | 448,197 | 1,058.77 | 30.95 | 48.36 | 18.72 | 29.24 | | tile3,1 | 256 | 448,890 | 1,055.26 | 31.05 | 48.52 | 18.69 | 29.20 |
[single_col] M=256 K=256 N=64 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 64 | 106,037 | 1,058.00 | 30.97 | 48.39 | 19.78 | 30.90 | | tile2,1 | 64 | 106,038 | 1,058.00 | 30.97 | 48.39 | 19.78 | 30.90 |
[single_col] M=256 K=64 N=128 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 32 | 56,782 | 1,150.00 | 28.49 | 44.52 | 18.47 | 28.85 | | tile2,1 | 32 | 56,782 | 1,150.00 | 28.49 | 44.52 | 18.47 | 28.85 |
[single_col] M=256 K=64 N=256 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 64 | 115,879 | 1,150.00 | 28.49 | 44.52 | 18.10 | 28.28 | | tile2,1 | 64 | 115,879 | 1,150.00 | 28.49 | 44.52 | 18.10 | 28.28 |
[single_col] M=256 K=64 N=64 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 16 | 27,233 | 1,150.00 | 28.49 | 44.52 | 19.25 | 30.08 | | tile3,1 | 16 | 27,233 | 1,150.00 | 28.49 | 44.52 | 19.25 | 30.08 |
[single_col] M=64 K=128 N=128 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 16 | 23,191 | 1,046.00 | 31.33 | 48.95 | 22.61 | 35.32 | | tile3,1 | 16 | 23,706 | 1,046.00 | 31.33 | 48.95 | 22.12 | 34.56 |
[single_col] M=64 K=128 N=256 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 50,191 | 1,046.00 | 31.33 | 48.95 | 20.89 | 32.64 | | tile3,1 | 32 | 50,708 | 1,046.00 | 31.33 | 48.95 | 20.68 | 32.31 |
[single_col] M=64 K=128 N=64 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 8 | 9,685 | 1,046.00 | 31.33 | 48.95 | 27.07 | 42.29 | | tile2,1 | 8 | 9,687 | 1,046.00 | 31.33 | 48.95 | 27.06 | 42.28 |
[single_col] M=64 K=256 N=128 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 32 | 59,613 | 1,058.00 | 30.97 | 48.39 | 17.59 | 27.48 | | tile2,1 | 32 | 60,130 | 1,058.00 | 30.97 | 48.39 | 17.44 | 27.25 |
[single_col] M=64 K=256 N=256 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 64 | 127,654 | 1,058.00 | 30.97 | 48.39 | 16.43 | 25.67 | | tile2,1 | 64 | 128,165 | 1,058.00 | 30.97 | 48.39 | 16.36 | 25.57 |
[single_col] M=64 K=256 N=64 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 16 | 25,585 | 1,058.00 | 30.97 | 48.39 | 20.49 | 32.02 | | tile2,1 | 16 | 26,100 | 1,058.00 | 30.97 | 48.39 | 20.09 | 31.39 |
[single_col] M=64 K=64 N=128 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 8 | 13,315 | 1,134.00 | 28.90 | 45.15 | 19.69 | 30.76 | | tile3,1 | 8 | 13,828 | 1,134.00 | 28.90 | 45.15 | 18.96 | 29.62 |
[single_col] M=64 K=64 N=256 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 16 | 28,387 | 1,134.00 | 28.90 | 45.15 | 18.47 | 28.86 | | tile3,1 | 16 | 28,903 | 1,134.00 | 28.90 | 45.15 | 18.14 | 28.34 |
[single_col] M=64 K=64 N=64 R=2 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 4 | 5,781 | 1,134.00 | 28.90 | 45.15 | 22.67 | 35.43 | | tile2,1 | 4 | 5,781 | 1,134.00 | 28.90 | 45.15 | 22.67 | 35.43 |
[single_core] M=128 K=128 N=128 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 103,700 | 1,070.00 | 30.62 | 47.85 | 20.22 | 31.60 |
[single_core] M=128 K=128 N=256 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 128 | 209,746 | 1,070.00 | 30.62 | 47.85 | 20.00 | 31.25 |
[single_core] M=128 K=128 N=64 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 50,678 | 1,070.00 | 30.62 | 47.85 | 20.69 | 32.33 |
[single_core] M=128 K=256 N=128 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 128 | 222,770 | 1,058.00 | 30.97 | 48.39 | 18.83 | 29.42 |
[single_core] M=128 K=256 N=256 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 256 | 449,911 | 1,058.00 | 30.97 | 48.39 | 18.65 | 29.13 |
[single_core] M=128 K=256 N=64 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 109,192 | 1,058.00 | 30.97 | 48.39 | 19.21 | 30.01 |
[single_core] M=128 K=64 N=128 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 53,069 | 1,078.00 | 30.40 | 47.50 | 19.76 | 30.87 |
[single_core] M=128 K=64 N=256 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 107,388 | 1,078.00 | 30.40 | 47.50 | 19.53 | 30.51 |
[single_core] M=128 K=64 N=64 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 16 | 25,903 | 1,078.00 | 30.40 | 47.50 | 20.24 | 31.63 |
[single_core] M=256 K=128 N=128 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 128 | 209,520 | 1,070.00 | 30.62 | 47.85 | 20.02 | 31.28 |
[single_core] M=256 K=128 N=256 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 256 | 421,374 | 1,070.00 | 30.62 | 47.85 | 19.91 | 31.11 |
[single_core] M=256 K=128 N=64 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 103,582 | 1,070.00 | 30.62 | 47.85 | 20.25 | 31.63 |
[single_core] M=256 K=256 N=128 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 256 | 437,365 | 1,058.00 | 30.97 | 48.39 | 19.18 | 29.97 |
[single_core] M=256 K=256 N=256 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 512 | 878,968 | 1,057.73 | 30.98 | 48.41 | 19.09 | 29.82 |
[single_core] M=256 K=256 N=64 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 128 | 216,506 | 1,058.00 | 30.97 | 48.39 | 19.37 | 30.27 |
[single_core] M=256 K=64 N=128 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 112,809 | 1,150.00 | 28.49 | 44.52 | 18.59 | 29.05 |
[single_core] M=256 K=64 N=256 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 128 | 227,165 | 1,346.51 | 24.34 | 38.02 | 18.46 | 28.85 |
[single_core] M=256 K=64 N=64 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 55,759 | 1,150.00 | 28.49 | 44.52 | 18.81 | 29.38 |
[single_core] M=64 K=128 N=128 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 45,585 | 1,046.00 | 31.33 | 48.95 | 23.00 | 35.94 |
[single_core] M=64 K=128 N=256 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 93,455 | 1,046.00 | 31.33 | 48.95 | 22.44 | 35.06 |
[single_core] M=64 K=128 N=64 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 16 | 21,654 | 1,046.00 | 31.33 | 48.95 | 24.21 | 37.83 |
[single_core] M=64 K=256 N=128 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 115,327 | 1,058.00 | 30.97 | 48.39 | 18.18 | 28.41 |
[single_core] M=64 K=256 N=256 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 128 | 235,018 | 1,058.00 | 30.97 | 48.39 | 17.85 | 27.89 |
[single_core] M=64 K=256 N=64 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 55,501 | 1,058.00 | 30.97 | 48.39 | 18.89 | 29.52 |
[single_core] M=64 K=64 N=128 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 16 | 26,864 | 1,134.00 | 28.90 | 45.15 | 19.52 | 30.49 |
[single_core] M=64 K=64 N=256 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 54,972 | 1,134.00 | 28.90 | 45.15 | 19.07 | 29.80 |
[single_core] M=64 K=64 N=64 R=1 C=1 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 8 | 12,808 | 1,134.00 | 28.90 | 45.15 | 20.47 | 31.98 |
[whole_array] M=128 K=128 N=128 R=2 C=4 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 8 | 12,859 | 1,052.00 | 31.15 | 48.67 | 20.39 | 31.85 | | tile2,1 | 7 | 11,789 | 1,125.71 | 29.11 | 45.48 | 19.77 | 30.88 | | tile2,4 | 7 | 12,529 | 1,122.57 | 29.19 | 45.61 | 18.60 | 29.06 | | tile2,2 | 7 | 12,620 | 1,115.00 | 29.39 | 45.92 | 18.46 | 28.85 | | tile3,4 | 7 | 12,784 | 1,158.14 | 28.29 | 44.21 | 18.23 | 28.48 | | tile3,3 | 8 | 14,670 | 1,691.12 | 19.38 | 30.28 | 17.87 | 27.92 | | tile2,3 | 7 | 13,422 | 1,123.71 | 29.16 | 45.56 | 17.36 | 27.13 | | tile3,2 | 8 | 15,502 | 1,881.12 | 17.42 | 27.22 | 16.91 | 26.42 |
[whole_array] M=128 K=256 N=128 R=2 C=4 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,3 | 15 | 25,474 | 1,152.00 | 28.44 | 44.44 | 20.58 | 32.16 | | tile2,4 | 15 | 25,599 | 1,131.13 | 28.97 | 45.26 | 20.48 | 32.00 | | tile3,4 | 15 | 25,604 | 1,160.67 | 28.23 | 44.11 | 20.48 | 32.00 | | tile2,3 | 15 | 25,730 | 1,139.87 | 28.75 | 44.92 | 20.38 | 31.84 | | tile2,2 | 15 | 25,740 | 1,140.53 | 28.73 | 44.89 | 20.37 | 31.83 | | tile2,1 | 16 | 27,424 | 1,836.06 | 17.85 | 27.89 | 19.12 | 29.87 | | tile3,1 | 16 | 27,644 | 1,888.69 | 17.35 | 27.11 | 18.97 | 29.63 | | tile3,2 | 16 | 27,963 | 1,920.69 | 17.06 | 26.66 | 18.75 | 29.30 |
[whole_array] M=128 K=64 N=128 R=2 C=4 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 3 | 5,042 | 1,098.67 | 29.83 | 46.60 | 20.80 | 32.49 | | tile3,1 | 4 | 7,217 | 1,529.50 | 21.42 | 33.47 | 18.16 | 28.38 | | tile2,4 | 3 | 6,037 | 1,145.33 | 28.61 | 44.70 | 17.37 | 27.14 | | tile3,4 | 3 | 6,153 | 1,182.00 | 27.72 | 43.32 | 17.04 | 26.63 | | tile2,2 | 3 | 6,979 | 1,092.00 | 30.01 | 46.89 | 15.02 | 23.48 | | tile3,2 | 4 | 9,390 | 1,622.00 | 20.20 | 31.57 | 13.96 | 21.81 | | tile2,3 | 3 | 10,600 | 1,150.00 | 28.49 | 44.52 | 9.89 | 15.46 | | tile3,3 | 3 | 10,622 | 1,154.67 | 28.38 | 44.34 | 9.87 | 15.42 |
[whole_array] M=64 K=128 N=128 R=2 C=4 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 4 | 6,209 | 1,150.00 | 28.49 | 44.52 | 21.11 | 32.98 | | tile2,1 | 4 | 6,212 | 1,150.00 | 28.49 | 44.52 | 21.10 | 32.97 | | tile2,4 | 4 | 7,187 | 1,150.00 | 28.49 | 44.52 | 18.24 | 28.50 | | tile3,4 | 4 | 7,192 | 1,150.00 | 28.49 | 44.52 | 18.22 | 28.48 | | tile3,2 | 4 | 8,210 | 1,150.00 | 28.49 | 44.52 | 15.96 | 24.95 | | tile2,2 | 4 | 8,213 | 1,150.00 | 28.49 | 44.52 | 15.96 | 24.94 | | tile2,3 | 4 | 11,888 | 1,150.00 | 28.49 | 44.52 | 11.03 | 17.23 | | tile3,3 | 4 | 11,894 | 1,150.00 | 28.49 | 44.52 | 11.02 | 17.22 |
[whole_array] M=64 K=128 N=256 R=2 C=4 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,3 | 8 | 13,761 | 1,150.00 | 28.49 | 44.52 | 19.05 | 29.77 | | tile3,4 | 8 | 13,761 | 1,150.00 | 28.49 | 44.52 | 19.05 | 29.77 | | tile2,4 | 8 | 13,761 | 1,150.00 | 28.49 | 44.52 | 19.05 | 29.77 | | tile2,1 | 8 | 13,762 | 1,150.00 | 28.49 | 44.52 | 19.05 | 29.76 | | tile3,1 | 8 | 13,763 | 1,150.00 | 28.49 | 44.52 | 19.05 | 29.76 | | tile3,2 | 8 | 13,818 | 1,150.00 | 28.49 | 44.52 | 18.97 | 29.64 | | tile2,2 | 8 | 13,818 | 1,150.00 | 28.49 | 44.52 | 18.97 | 29.64 | | tile2,3 | 8 | 14,047 | 1,150.00 | 28.49 | 44.52 | 18.66 | 29.16 |
[whole_array] M=64 K=256 N=128 R=2 C=4 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 8 | 12,952 | 1,150.00 | 28.49 | 44.52 | 20.24 | 31.62 | | tile2,1 | 8 | 12,956 | 1,150.00 | 28.49 | 44.52 | 20.23 | 31.61 | | tile2,4 | 8 | 13,690 | 1,150.00 | 28.49 | 44.52 | 19.15 | 29.92 | | tile3,4 | 8 | 13,694 | 1,150.00 | 28.49 | 44.52 | 19.14 | 29.91 | | tile3,2 | 8 | 13,996 | 1,150.00 | 28.49 | 44.52 | 18.73 | 29.27 | | tile2,2 | 8 | 14,000 | 1,150.00 | 28.49 | 44.52 | 18.72 | 29.26 | | tile2,3 | 8 | 15,131 | 1,150.00 | 28.49 | 44.52 | 17.32 | 27.07 | | tile3,3 | 8 | 15,136 | 1,150.00 | 28.49 | 44.52 | 17.32 | 27.06 |
[whole_array] M=64 K=256 N=256 R=2 C=4 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 16 | 29,064 | 1,131.62 | 28.96 | 45.24 | 18.04 | 28.19 | | tile3,1 | 16 | 29,355 | 1,150.00 | 28.49 | 44.52 | 17.86 | 27.91 | | tile2,2 | 16 | 29,394 | 1,141.44 | 28.71 | 44.86 | 17.84 | 27.87 | | tile3,2 | 16 | 29,527 | 1,150.00 | 28.49 | 44.52 | 17.76 | 27.74 | | tile2,4 | 16 | 30,075 | 1,150.00 | 28.49 | 44.52 | 17.43 | 27.24 | | tile3,4 | 16 | 30,079 | 1,150.00 | 28.49 | 44.52 | 17.43 | 27.23 | | tile2,3 | 16 | 30,222 | 1,134.19 | 28.89 | 45.14 | 17.35 | 27.11 | | tile3,3 | 16 | 30,478 | 1,150.00 | 28.49 | 44.52 | 17.20 | 26.88 |
[whole_array] M=64 K=64 N=128 R=2 C=4 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,4 | 2 | 2,826 | 1,150.00 | 28.49 | 44.52 | 23.19 | 36.23 | | tile3,3 | 2 | 2,826 | 1,150.00 | 28.49 | 44.52 | 23.19 | 36.23 | | tile3,1 | 2 | 2,826 | 1,150.00 | 28.49 | 44.52 | 23.19 | 36.23 | | tile2,4 | 2 | 2,826 | 1,150.00 | 28.49 | 44.52 | 23.19 | 36.23 | | tile2,2 | 2 | 2,826 | 1,150.00 | 28.49 | 44.52 | 23.19 | 36.23 | | tile2,3 | 2 | 2,826 | 1,150.00 | 28.49 | 44.52 | 23.19 | 36.23 | | tile2,1 | 2 | 2,826 | 1,150.00 | 28.49 | 44.52 | 23.19 | 36.23 | | tile3,2 | 2 | 2,827 | 1,150.00 | 28.49 | 44.52 | 23.18 | 36.22 |
[whole_array] M=64 K=64 N=256 R=2 C=4 | Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 4 | 6,869 | 1,150.00 | 28.49 | 44.52 | 19.08 | 29.82 | | tile3,2 | 4 | 6,869 | 1,150.00 | 28.49 | 44.52 | 19.08 | 29.82 | | tile2,4 | 4 | 6,869 | 1,150.00 | 28.49 | 44.52 | 19.08 | 29.82 | | tile2,3 | 4 | 6,869 | 1,150.00 | 28.49 | 44.52 | 19.08 | 29.82 | | tile2,1 | 4 | 6,869 | 1,150.00 | 28.49 | 44.52 | 19.08 | 29.82 | | tile3,3 | 4 | 6,870 | 1,150.00 | 28.49 | 44.52 | 19.08 | 29.81 | | tile3,4 | 4 | 6,870 | 1,150.00 | 28.49 | 44.52 | 19.08 | 29.81 | | tile2,2 | 4 | 6,870 | 1,150.00 | 28.49 | 44.52 | 19.08 | 29.81 |