Fixed Issues / Improvements
- Keep Fast Math Flags during memory operations simplifications.
- Allow float to packed half-float move on select platforms, second try.
- Fix handling saturation patterns.
- Force private memory to global buffer when generic load/store are present
- Optionally allow for compilation without payload header.
- Fix bug with setting of global variable in kernel arg offsets.
- Fix right bound computation for send destination.
- Fix in NoMask WA for the last BB.
- Change unroll threshold for high trip count, nested loops.
- Support for SPV_INTEL_noopt in OCL adaptor.
- Fix bugs in expandMulPostSchedule pass.
- Other minor fixes and improvements.
Dependencies revisions
- intel/opencl-clang@c8cd72e
- KhronosGroup/SPIRV-LLVM-Translator@424e375 (for opencl-clang)
- intel/vc-intrinsics@6713229
- KhronosGroup/SPIRV-LLVM-Translator@9d8d032 (for VectorCompiler)
- llvm/llvm-project@llvmorg-10.0.0
Ubuntu 18.04 binary packages for LLVM10/Clang10 are included.