Notes:
- This release contains new implementations of the implicitGEMM and Winograd algorithms, performance improvements for convolutions, further support for 3D convolutional networks, and various bug fixes.
Changes:
- Added 3D Pooling layers
- Added backwards data algorithm for implicitGEMM
- Added GEMM performance improvements via relaxed constraints in rocBLAS-Tensile
- Added full CO v3 support for all kernels in MIOpen
- Added new Winograd group convolution kernels
- Added an API to query MIOpen's version
- Added parallel compilation in initial convolutional algorithm search; partial solution to #130
- Added SQLite binary program cache
- Improved logging across all layers
- Improved MIOpen's internal design for calling convolutional solvers
- Fixed various bugs for the implicitGEMM algorithm