- Support for extended subgroup functions.
- Improve IGCConstProp pass to support constant folding in rounding-mode-specific GenISA intrinsics.
- Add sat and overflow operation support for add and sub.
- Fixes realted to cl_khr_fp16 and cl_khr_fp64 types.
- Introduce inline asm support for SPIRV reader.
- Handling of splitting of rare regions like r[a0.0]<2,0> correctly.
- Small CMake path change
- Refactor IEEE FP32 divide/sqrt macro to avoid hard-coded execution size.
- Fixing the implementations of FENCE_PRE_OP and FENCE_POST_OP. The OCL’s GPU memory model problem was originally identified and reported by Tyler Sorensen and Jake Kirkham from Princeton University.
- Add address register initialization on ICL platform.
- Implement scratch space allocation for CMC thread-private memory.
- Increase OCL inlining threshold to 4500 to resolve the performance regression reported.
- And other bug fixes.
Ubuntu 18.04 binary packages for LLVM9/Clang9 are included.