github libxsmm/libxsmm 0.9.1
Version 0.9.1

latest releases: 1.old_kernelapi_rip, 1.libxsmm_dnn_rip, 1.eol...
8 years ago

This is mainly a bug fix release correcting the AVX-512 code for N=9 and K being a multiple of 16 (DP) or 32 (SP). In addition, the samples (blas, dispatched, inlined, and specialized) are consolidated into a single sample folder. The latter also comes with a performance evaluation script (run script and Gnuplot script). The more complex "cp2k" code sample has been renamed as well along with slightly improved Gnuplot scripts.

Don't miss a new libxsmm release

NewReleases is sending notifications on new releases.