github libxsmm/libxsmm 1.6.4
Version 1.6.4

latest releases: 1.old_kernelapi_rip, 1.libxsmm_dnn_rip, 1.eol...
7 years ago

This is a maintenance release with improvements for GCC and Clang compilers (function-level target compilation, and intrinsics support). The function-level target compilation is a prerequisite for good performance due to CPUID-dispatched code paths. Moreover and in preparation of v1.7, there are breaking changes in the DNN domain (buffer management is now an external responsibility). An API for logical padding has been added (DNN domain). In addition to our Travis CI, an improved test coverage for a variety of compiler versions is now in place.

INTRODUCED

  • SPMDM: introduced CPUID-dispatched code paths
  • SPMDM: support for transposing C

CHANGES

  • No distinction between SSE 4.1/4.2 (new enum LIBXSMM_X86_SSE4, removed LIBXSMM_X86_SSE4_*)
  • DNN: removed create_buffer and create_filter functions since buffers are provided externally
  • DNN: updated googlenetv1 script to match googlenetv1 description
  • DNN: initial changes to support logical input padding
  • DNN: improved performance of weight update
  • DNN: new padding frontend API

FIXES

  • Fixed intrinsic layer for reliable target compilation (function level), and clean switches for legacy compilers, included FMA flag when targeting AVX2 on GCC and Clang
  • DNN: fix in image parallel forward convolution when 2d register blocking is used
  • DNN: fixed physical input padding for backward and weight update (all format combinations)
  • DNN: fixed physical padding in the fallback code path
  • DNN: fixed some corner case prefetching bug
  • SPMDM: fixed library initialization

Don't miss a new libxsmm release

NewReleases is sending notifications on new releases.