Chapter 7.4 : The intrinsics version of Saxpy
- 7.4.1) The main_intrinsics.cpp
- 7.4.2) The CMakeLists.txt
- 7.4.3) The compilation
- 7.4.4) The performances
During this section, we will use :
- Inclusion of file immintrin.h
- Intrinsic function : _mm256_load_ps
- Intrinsic function : _mm256_store_ps
- Intrinsic function : _mm256_mul_ps
- Intrinsic function : _mm256_add_ps
- Intrinsic function : _mm256_broadcast_ss (to duplicate a float 8 times in a vectorial register)
- Enable specific optimisations with -O3 -march=native -mtune=native -mavx2