This is the Alpha release for oneDNN Graph API based on oneDNN v2.6 release.
Functionality
-
Introduced FP32 and BF16 training support on CPU.
-
Introduced multiple layer perceptron (MLP) fusion supported by oneDNN Graph compiler with optimized code generation (experimental).
-
Updated API to comply with oneDNN Graph API specification v1.0-alpha.
Known Issues and Limitations
-
The weight’s opaque layout can be queried only from a compiled partition, which requires that input tensor shapes must be known at compilation time.
-
MHA and MLP fusion are not activated on machines without AVX-512 support, as oneDNN Graph compiler generates AVX-512 and newer instructions.
Thanks to the Contributors
This release contains contributions from the project core teams as well as Jiong Gong, Chunyuan Wu, Sanchit Jain, Yiqiang Li, Yunfei Mao, Kiefer Kuah and others.