This is a patch release containing the following changes to v3.3.1:
- Fixed incorrect results in bfloat16 reorder on Intel Core Ultra integrates GPUs (9025980, ed9de2a, 0c6bda1)
- Fixed incorrect results in matmul, inner product, and RNN primitives on Intel Core Ultra integrated GPUs (6edab9f)
- Updated compiler optimization flags for AArch64 processors to make build portable (8829c24)
- Fixed segmentation fault during library initialization on AArch64 processors (3e15c61)