CI run 3ed0d2f
- Run: link
- Time: Denver
2025-08-26 02:40:27 MDT
• Brussels 2025-08-26 10:40:27 CEST
GEMM Deployment Summary
- Workflow:
AIE Deployment Gemm
- Commit:
3ed0d2f8cef99eb36cd88893a0bd602208b9b827
- Runner:
venus
- Run time: Denver
2025-08-26 02:40:08 MDT
• Brussels 2025-08-26 10:40:08 CEST
- Run: #38; Attempt 1
HW | M | K | N | Status | Note |
single_col | 128 | 128 | 128 | ✅ success | |
single_col | 128 | 128 | 256 | ❌ failed | missing status.ok |
single_col | 128 | 128 | 64 | ✅ success | |
single_col | 128 | 256 | 128 | ❌ failed | missing status.ok |
single_col | 128 | 256 | 256 | ❌ failed | missing status.ok |
single_col | 128 | 256 | 64 | ❌ failed | missing status.ok |
single_col | 128 | 64 | 128 | ✅ success | |
single_col | 128 | 64 | 256 | ❌ failed | missing status.ok |
single_col | 128 | 64 | 64 | ✅ success | |
single_col | 256 | 128 | 128 | ✅ success | |
single_col | 256 | 128 | 256 | ❌ failed | missing status.ok |
single_col | 256 | 128 | 64 | ✅ success | |
single_col | 256 | 256 | 128 | ❌ failed | missing status.ok |
single_col | 256 | 256 | 256 | ❌ failed | missing status.ok |
single_col | 256 | 256 | 64 | ❌ failed | missing status.ok |
single_col | 256 | 64 | 128 | ✅ success | |
single_col | 256 | 64 | 256 | ❌ failed | missing status.ok |
single_col | 256 | 64 | 64 | ✅ success | |
single_col | 64 | 128 | 128 | ✅ success | |
single_col | 64 | 128 | 256 | ❌ failed | missing status.ok |
single_col | 64 | 128 | 64 | ✅ success | |
single_col | 64 | 256 | 128 | ❌ failed | missing status.ok |
single_col | 64 | 256 | 256 | ❌ failed | missing status.ok |
single_col | 64 | 256 | 64 | ❌ failed | missing status.ok |
single_col | 64 | 64 | 128 | ✅ success | |
single_col | 64 | 64 | 256 | ❌ failed | missing status.ok |
single_col | 64 | 64 | 64 | ✅ success | |
single_core | 128 | 128 | 128 | ❌ failed | missing status.ok |
single_core | 128 | 128 | 256 | ✅ success | |
single_core | 128 | 128 | 64 | ✅ success | |
single_core | 128 | 256 | 128 | ✅ success | |
single_core | 128 | 256 | 256 | ✅ success | |
single_core | 128 | 256 | 64 | ✅ success | |
single_core | 128 | 64 | 128 | ✅ success | |
single_core | 128 | 64 | 256 | ✅ success | |
single_core | 128 | 64 | 64 | ✅ success | |
single_core | 256 | 128 | 128 | ❌ failed | missing status.ok |
single_core | 256 | 128 | 256 | ✅ success | |
single_core | 256 | 128 | 64 | ✅ success | |
single_core | 256 | 256 | 128 | ✅ success | |
single_core | 256 | 256 | 256 | ✅ success | |
single_core | 256 | 256 | 64 | ✅ success | |
single_core | 256 | 64 | 128 | ✅ success | |
single_core | 256 | 64 | 256 | ✅ success | |
single_core | 256 | 64 | 64 | ✅ success | |
single_core | 64 | 128 | 128 | ✅ success | |
single_core | 64 | 128 | 256 | ✅ success | |
single_core | 64 | 128 | 64 | ✅ success | |
single_core | 64 | 256 | 128 | ✅ success | |
single_core | 64 | 256 | 256 | ✅ success | |
single_core | 64 | 256 | 64 | ✅ success | |
single_core | 64 | 64 | 128 | ✅ success | |
single_core | 64 | 64 | 256 | ✅ success | |
single_core | 64 | 64 | 64 | ✅ success | |
whole_array | 128 | 128 | 128 | ✅ success | |
whole_array | 128 | 128 | 256 | ❌ failed | missing status.ok |
whole_array | 128 | 128 | 64 | ✅ success | |
whole_array | 128 | 256 | 128 | ✅ success | |
whole_array | 128 | 256 | 256 | ❌ failed | missing status.ok |
whole_array | 128 | 256 | 64 | ✅ success | |
whole_array | 128 | 64 | 128 | ✅ success | |
whole_array | 128 | 64 | 256 | ❌ failed | missing status.ok |
whole_array | 128 | 64 | 64 | ✅ success | |
whole_array | 256 | 128 | 128 | ✅ success | |
whole_array | 256 | 128 | 256 | ❌ failed | missing status.ok |
whole_array | 256 | 128 | 64 | ✅ success | |
whole_array | 256 | 256 | 128 | ✅ success | |
whole_array | 256 | 256 | 256 | ❌ failed | missing status.ok |
whole_array | 256 | 256 | 64 | ✅ success | |
whole_array | 256 | 64 | 128 | ✅ success | |
whole_array | 256 | 64 | 256 | ❌ failed | missing status.ok |
whole_array | 256 | 64 | 64 | ✅ success | |
whole_array | 64 | 128 | 128 | ✅ success | |
whole_array | 64 | 128 | 256 | ✅ success | |
whole_array | 64 | 128 | 64 | ✅ success | |
whole_array | 64 | 256 | 128 | ✅ success | |
whole_array | 64 | 256 | 256 | ✅ success | |
whole_array | 64 | 256 | 64 | ✅ success | |
whole_array | 64 | 64 | 128 | ✅ success | |
whole_array | 64 | 64 | 256 | ✅ success | |
whole_array | 64 | 64 | 64 | ✅ success | |
Totals: ✅ 58
• ❌ 23
• All: 81
Details for Successful Runs
[single_col] M=128 K=128 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 61,029 | 1,150.00 | 28.49 | 44.52 | 17.18 | 26.85 | | tile3,1 | 32 | 61,029 | 1,150.00 | 28.49 | 44.52 | 17.18 | 26.85 | Details for Successful Runs
[single_col] M=128 K=128 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 16 | 28,363 | 1,150.00 | 28.49 | 44.52 | 18.48 | 28.88 | | tile3,1 | 16 | 28,364 | 1,150.00 | 28.49 | 44.52 | 18.48 | 28.88 | Details for Successful Runs
[single_col] M=128 K=64 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 16 | 31,187 | 1,150.00 | 28.49 | 44.52 | 16.81 | 26.27 | | tile2,1 | 16 | 31,187 | 1,150.00 | 28.49 | 44.52 | 16.81 | 26.27 | Details for Successful Runs
[single_col] M=128 K=64 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 8 | 12,940 | 1,150.00 | 28.49 | 44.52 | 20.26 | 31.65 | | tile3,1 | 8 | 12,940 | 1,150.00 | 28.49 | 44.52 | 20.26 | 31.65 | Details for Successful Runs
[single_col] M=256 K=128 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 64 | 120,208 | 1,150.00 | 28.49 | 44.52 | 17.45 | 27.26 | | tile2,1 | 64 | 120,208 | 1,150.00 | 28.49 | 44.52 | 17.45 | 27.26 | Details for Successful Runs
[single_col] M=256 K=128 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 57,950 | 1,150.00 | 28.49 | 44.52 | 18.09 | 28.27 | | tile3,1 | 32 | 57,951 | 1,150.00 | 28.49 | 44.52 | 18.09 | 28.27 | Details for Successful Runs
[single_col] M=256 K=64 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 59,503 | 1,150.00 | 28.49 | 44.52 | 17.62 | 27.53 | | tile3,1 | 32 | 59,503 | 1,150.00 | 28.49 | 44.52 | 17.62 | 27.53 | Details for Successful Runs
[single_col] M=256 K=64 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 16 | 27,104 | 1,150.00 | 28.49 | 44.52 | 19.34 | 30.22 | | tile3,1 | 16 | 27,105 | 1,150.00 | 28.49 | 44.52 | 19.34 | 30.22 | Details for Successful Runs
[single_col] M=64 K=128 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 16 | 31,450 | 1,150.00 | 28.49 | 44.52 | 16.67 | 26.05 | | tile2,1 | 16 | 31,965 | 1,150.00 | 28.49 | 44.52 | 16.40 | 25.63 | Details for Successful Runs
[single_col] M=64 K=128 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 8 | 13,574 | 1,150.00 | 28.49 | 44.52 | 19.31 | 30.18 | | tile2,1 | 8 | 14,089 | 1,150.00 | 28.49 | 44.52 | 18.61 | 29.07 | Details for Successful Runs
[single_col] M=64 K=64 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 8 | 13,293 | 1,134.00 | 28.90 | 45.15 | 19.72 | 30.81 | | tile3,1 | 8 | 13,807 | 1,134.00 | 28.90 | 45.15 | 18.99 | 29.67 | Details for Successful Runs
[single_col] M=64 K=64 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 4 | 5,781 | 1,134.00 | 28.90 | 45.15 | 22.67 | 35.43 | | tile3,1 | 4 | 5,845 | 1,150.00 | 28.49 | 44.52 | 22.42 | 35.04 | Details for Successful Runs
[single_core] M=128 K=128 N=256
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 128 | 242,699 | 1,150.00 | 28.49 | 44.52 | 17.28 | 27.00 | Details for Successful Runs
[single_core] M=128 K=128 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 58,982 | 1,150.00 | 28.49 | 44.52 | 17.78 | 27.78 | Details for Successful Runs
[single_core] M=128 K=256 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 128 | 232,308 | 1,150.00 | 28.49 | 44.52 | 18.05 | 28.21 | Details for Successful Runs
[single_core] M=128 K=256 N=256
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 256 | 468,019 | 1,150.00 | 28.49 | 44.52 | 17.92 | 28.01 | Details for Successful Runs
[single_core] M=128 K=256 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 114,420 | 1,150.00 | 28.49 | 44.52 | 18.33 | 28.64 | Details for Successful Runs
[single_core] M=128 K=64 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 55,463 | 1,150.00 | 28.49 | 44.52 | 18.91 | 29.54 | Details for Successful Runs
[single_core] M=128 K=64 N=256
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 112,177 | 1,150.00 | 28.49 | 44.52 | 18.70 | 29.21 | Details for Successful Runs
[single_core] M=128 K=64 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 16 | 27,115 | 1,150.00 | 28.49 | 44.52 | 19.34 | 30.21 | Details for Successful Runs
[single_core] M=256 K=128 N=256
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 256 | 479,207 | 1,148.99 | 28.52 | 44.56 | 17.51 | 27.35 | Details for Successful Runs
[single_core] M=256 K=128 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 118,158 | 1,150.00 | 28.49 | 44.52 | 17.75 | 27.73 | Details for Successful Runs
[single_core] M=256 K=256 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 256 | 459,151 | 1,150.00 | 28.49 | 44.52 | 18.27 | 28.55 | Details for Successful Runs
[single_core] M=256 K=256 N=256
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 511 | 924,987 | 1,151.18 | 28.46 | 44.48 | 18.14 | 28.34 | Details for Successful Runs
[single_core] M=256 K=256 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 128 | 227,853 | 1,150.00 | 28.49 | 44.52 | 18.41 | 28.76 | Details for Successful Runs
[single_core] M=256 K=64 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 112,121 | 1,150.00 | 28.49 | 44.52 | 18.70 | 29.23 | Details for Successful Runs
[single_core] M=256 K=64 N=256
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 128 | 225,899 | 1,346.84 | 24.33 | 38.01 | 18.57 | 29.01 | Details for Successful Runs
[single_core] M=256 K=64 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 55,443 | 1,150.00 | 28.49 | 44.52 | 18.91 | 29.55 | Details for Successful Runs
[single_core] M=64 K=128 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 61,060 | 1,150.00 | 28.49 | 44.52 | 17.17 | 26.83 | Details for Successful Runs
[single_core] M=64 K=128 N=256
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 124,379 | 1,150.00 | 28.49 | 44.52 | 16.86 | 26.35 | Details for Successful Runs
[single_core] M=64 K=128 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 16 | 29,402 | 1,150.00 | 28.49 | 44.52 | 17.83 | 27.86 | Details for Successful Runs
[single_core] M=64 K=256 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 64 | 118,856 | 1,150.00 | 28.49 | 44.52 | 17.64 | 27.57 | Details for Successful Runs
[single_core] M=64 K=256 N=256
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 128 | 241,159 | 1,150.00 | 28.49 | 44.52 | 17.39 | 27.18 | Details for Successful Runs
[single_core] M=64 K=256 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 57,706 | 1,150.00 | 28.49 | 44.52 | 18.17 | 28.39 | Details for Successful Runs
[single_core] M=64 K=64 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 16 | 26,796 | 1,134.00 | 28.90 | 45.15 | 19.57 | 30.57 | Details for Successful Runs
[single_core] M=64 K=64 N=256
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 54,842 | 1,134.00 | 28.90 | 45.15 | 19.12 | 29.87 | Details for Successful Runs
[single_core] M=64 K=64 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 8 | 12,786 | 1,134.00 | 28.90 | 45.15 | 20.50 | 32.04 | Details for Successful Runs
[whole_array] M=128 K=128 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 16 | 29,403 | 1,150.00 | 28.49 | 44.52 | 17.83 | 27.86 | | tile2,1 | 16 | 29,410 | 1,150.00 | 28.49 | 44.52 | 17.83 | 27.85 | | tile2,2 | 16 | 30,344 | 1,150.00 | 28.49 | 44.52 | 17.28 | 27.00 | | tile3,2 | 16 | 30,345 | 1,150.00 | 28.49 | 44.52 | 17.28 | 27.00 | Details for Successful Runs
[whole_array] M=128 K=128 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 8 | 13,407 | 1,128.88 | 29.03 | 45.35 | 19.55 | 30.55 | | tile3,1 | 8 | 13,408 | 1,129.25 | 29.02 | 45.34 | 19.55 | 30.55 | | tile3,2 | 8 | 13,960 | 1,150.00 | 28.49 | 44.52 | 18.78 | 29.34 | | tile2,2 | 8 | 15,383 | 1,957.00 | 16.74 | 26.16 | 17.04 | 26.63 | Details for Successful Runs
[whole_array] M=128 K=256 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,2 | 32 | 57,707 | 1,150.00 | 28.49 | 44.52 | 18.17 | 28.39 | | tile2,1 | 32 | 57,708 | 1,150.00 | 28.49 | 44.52 | 18.17 | 28.39 | | tile3,1 | 32 | 58,037 | 1,150.00 | 28.49 | 44.52 | 18.07 | 28.23 | | tile3,2 | 32 | 58,037 | 1,150.00 | 28.49 | 44.52 | 18.07 | 28.23 | Details for Successful Runs
[whole_array] M=128 K=256 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,2 | 16 | 27,130 | 1,150.00 | 28.49 | 44.52 | 19.33 | 30.20 | | tile2,1 | 16 | 27,131 | 1,150.00 | 28.49 | 44.52 | 19.32 | 30.19 | | tile3,1 | 16 | 27,879 | 1,150.00 | 28.49 | 44.52 | 18.81 | 29.38 | | tile3,2 | 16 | 27,879 | 1,150.00 | 28.49 | 44.52 | 18.81 | 29.38 | Details for Successful Runs
[whole_array] M=128 K=64 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,2 | 8 | 12,658 | 1,118.00 | 29.31 | 45.80 | 20.71 | 32.36 | | tile2,1 | 8 | 12,721 | 1,118.00 | 29.31 | 45.80 | 20.61 | 32.20 | | tile3,1 | 8 | 12,786 | 1,134.00 | 28.90 | 45.15 | 20.50 | 32.04 | | tile3,2 | 8 | 13,491 | 1,134.00 | 28.90 | 45.15 | 19.43 | 30.36 | Details for Successful Runs
[whole_array] M=128 K=64 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,2 | 4 | 5,717 | 1,118.00 | 29.31 | 45.80 | 22.93 | 35.82 | | tile2,1 | 4 | 5,717 | 1,118.00 | 29.31 | 45.80 | 22.93 | 35.82 | | tile3,1 | 4 | 5,781 | 1,134.00 | 28.90 | 45.15 | 22.67 | 35.43 | | tile3,2 | 4 | 5,781 | 1,134.00 | 28.90 | 45.15 | 22.67 | 35.43 | Details for Successful Runs
[whole_array] M=256 K=128 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,2 | 32 | 58,982 | 1,150.00 | 28.49 | 44.52 | 17.78 | 27.78 | | tile2,1 | 32 | 58,983 | 1,150.00 | 28.49 | 44.52 | 17.78 | 27.78 | | tile3,2 | 32 | 59,499 | 1,150.00 | 28.49 | 44.52 | 17.62 | 27.54 | | tile3,1 | 32 | 59,499 | 1,150.00 | 28.49 | 44.52 | 17.62 | 27.54 | Details for Successful Runs
[whole_array] M=256 K=128 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 16 | 28,363 | 1,150.00 | 28.49 | 44.52 | 18.48 | 28.88 | | tile3,2 | 16 | 28,363 | 1,150.00 | 28.49 | 44.52 | 18.48 | 28.88 | | tile3,1 | 16 | 28,363 | 1,150.00 | 28.49 | 44.52 | 18.48 | 28.88 | | tile2,2 | 16 | 28,364 | 1,150.00 | 28.49 | 44.52 | 18.48 | 28.88 | Details for Successful Runs
[whole_array] M=256 K=256 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,2 | 64 | 114,422 | 1,150.00 | 28.49 | 44.52 | 18.33 | 28.64 | | tile2,1 | 64 | 114,422 | 1,150.00 | 28.49 | 44.52 | 18.33 | 28.64 | | tile3,1 | 64 | 115,517 | 1,150.00 | 28.49 | 44.52 | 18.15 | 28.37 | | tile3,2 | 64 | 115,518 | 1,150.00 | 28.49 | 44.52 | 18.15 | 28.37 | Details for Successful Runs
[whole_array] M=256 K=256 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 32 | 55,488 | 1,150.00 | 28.49 | 44.52 | 18.90 | 29.53 | | tile2,2 | 32 | 55,488 | 1,150.00 | 28.49 | 44.52 | 18.90 | 29.53 | | tile3,1 | 32 | 55,725 | 1,150.00 | 28.49 | 44.52 | 18.82 | 29.40 | | tile3,2 | 32 | 55,725 | 1,150.00 | 28.49 | 44.52 | 18.82 | 29.40 | Details for Successful Runs
[whole_array] M=256 K=64 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,2 | 16 | 34,695 | 1,150.00 | 28.49 | 44.52 | 15.11 | 23.61 | | tile2,1 | 16 | 34,696 | 1,150.00 | 28.49 | 44.52 | 15.11 | 23.61 | | tile3,1 | 16 | 35,801 | 1,150.00 | 28.49 | 44.52 | 14.64 | 22.88 | | tile3,2 | 16 | 35,802 | 1,150.00 | 28.49 | 44.52 | 14.64 | 22.88 | Details for Successful Runs
[whole_array] M=256 K=64 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,2 | 8 | 12,940 | 1,150.00 | 28.49 | 44.52 | 20.26 | 31.65 | | tile2,1 | 8 | 12,941 | 1,150.00 | 28.49 | 44.52 | 20.26 | 31.65 | | tile3,2 | 8 | 13,352 | 1,150.00 | 28.49 | 44.52 | 19.63 | 30.68 | | tile3,1 | 8 | 13,352 | 1,150.00 | 28.49 | 44.52 | 19.63 | 30.68 | Details for Successful Runs
[whole_array] M=64 K=128 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,2 | 8 | 13,573 | 1,150.00 | 28.49 | 44.52 | 19.31 | 30.18 | | tile2,1 | 8 | 13,573 | 1,150.00 | 28.49 | 44.52 | 19.31 | 30.18 | | tile3,1 | 8 | 13,573 | 1,150.00 | 28.49 | 44.52 | 19.31 | 30.18 | | tile3,2 | 8 | 14,627 | 1,150.00 | 28.49 | 44.52 | 17.92 | 28.00 | Details for Successful Runs
[whole_array] M=64 K=128 N=256
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,2 | 16 | 28,364 | 1,150.00 | 28.49 | 44.52 | 18.48 | 28.88 | | tile2,1 | 16 | 28,364 | 1,150.00 | 28.49 | 44.52 | 18.48 | 28.88 | | tile3,1 | 16 | 29,503 | 1,150.00 | 28.49 | 44.52 | 17.77 | 27.77 | | tile3,2 | 16 | 29,510 | 1,150.00 | 28.49 | 44.52 | 17.77 | 27.76 | Details for Successful Runs
[whole_array] M=64 K=128 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,2 | 4 | 6,178 | 1,150.00 | 28.49 | 44.52 | 21.22 | 33.15 | | tile2,1 | 4 | 6,179 | 1,150.00 | 28.49 | 44.52 | 21.21 | 33.14 | | tile3,2 | 4 | 6,686 | 1,150.00 | 28.49 | 44.52 | 19.60 | 30.63 | | tile3,1 | 4 | 6,694 | 1,150.00 | 28.49 | 44.52 | 19.58 | 30.59 | Details for Successful Runs
[whole_array] M=64 K=256 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 16 | 29,351 | 1,150.00 | 28.49 | 44.52 | 17.86 | 27.91 | | tile2,1 | 16 | 29,359 | 1,150.00 | 28.49 | 44.52 | 17.86 | 27.90 | | tile2,2 | 16 | 30,307 | 1,150.00 | 28.49 | 44.52 | 17.30 | 27.03 | | tile3,2 | 16 | 30,308 | 1,150.00 | 28.49 | 44.52 | 17.30 | 27.03 | Details for Successful Runs
[whole_array] M=64 K=256 N=256
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,1 | 32 | 62,158 | 1,150.00 | 28.49 | 44.52 | 16.87 | 26.36 | | tile2,1 | 32 | 62,165 | 1,150.00 | 28.49 | 44.52 | 16.87 | 26.36 | | tile2,2 | 32 | 62,866 | 1,150.00 | 28.49 | 44.52 | 16.68 | 26.06 | | tile3,2 | 32 | 62,867 | 1,150.00 | 28.49 | 44.52 | 16.68 | 26.06 | Details for Successful Runs
[whole_array] M=64 K=256 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 8 | 12,953 | 1,150.00 | 28.49 | 44.52 | 20.24 | 31.62 | | tile3,1 | 8 | 12,953 | 1,150.00 | 28.49 | 44.52 | 20.24 | 31.62 | | tile3,2 | 8 | 13,694 | 1,150.00 | 28.49 | 44.52 | 19.14 | 29.91 | | tile2,2 | 8 | 13,702 | 1,150.00 | 28.49 | 44.52 | 19.13 | 29.89 | Details for Successful Runs
[whole_array] M=64 K=64 N=128
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 4 | 5,858 | 1,150.00 | 28.49 | 44.52 | 22.37 | 34.96 | | tile2,2 | 4 | 5,858 | 1,150.00 | 28.49 | 44.52 | 22.37 | 34.96 | | tile3,2 | 4 | 5,858 | 1,150.00 | 28.49 | 44.52 | 22.37 | 34.96 | | tile3,1 | 4 | 5,858 | 1,150.00 | 28.49 | 44.52 | 22.37 | 34.96 | Details for Successful Runs
[whole_array] M=64 K=64 N=256
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile2,1 | 8 | 12,940 | 1,150.00 | 28.49 | 44.52 | 20.26 | 31.65 | | tile3,2 | 8 | 12,940 | 1,150.00 | 28.49 | 44.52 | 20.26 | 31.65 | | tile2,2 | 8 | 12,941 | 1,150.00 | 28.49 | 44.52 | 20.26 | 31.65 | | tile3,1 | 8 | 12,941 | 1,150.00 | 28.49 | 44.52 | 20.26 | 31.65 | Details for Successful Runs
[whole_array] M=64 K=64 N=64
| Tile | Kernels | Total cycles | Avg cycles per kernel | MACs/cycle (kernel) | Peak eff. kernel % | MACs/cycle (system) | Peak eff. system % | |------|---------|--------------|-----------------------|---------------------|--------------------|---------------------|--------------------| | tile3,2 | 2 | 2,317 | 1,150.00 | 28.49 | 44.52 | 28.28 | 44.20 | | tile3,1 | 2 | 2,317 | 1,150.00 | 28.49 | 44.52 | 28.28 | 44.20 | | tile2,2 | 2 | 2,317 | 1,150.00 | 28.49 | 44.52 | 28.28 | 44.20 | | tile2,1 | 2 | 2,317 | 1,150.00 | 28.49 | 44.52 | 28.28 | 44.20 |