AMD Optimized BLIS Version 2.1
Highlights of improvements on AMD EPYCTM processor family CPUs
- Improved performance of SGEMM and DGEMM for small and skinny size matrices
- Improved TRSM single thread performance for small and skinny size matrices
- BLIS build now supports both AMD "zen" and "zen2" configurations with auto config option
- Support for C++ Template APIs for all BLAS functions