Chapter 10.6 : Intrinsics implementation with a pitch
- 10.6.1) The sgemm_intrinsics_pitch.h file
- 10.6.2) The sgemm_intrinsics_pitch.cpp file
- 10.6.3) The main_sgemm_intrinsics_pitch.cpp file
- 10.6.4) The CMakeLists.txt file
- 10.6.5) The compilation
- 10.6.6) The performances
The simpler solution is to add elements at the end of each row to ensure the next one is aligned on a vectorial register size too.