This is the Beta Update 3 release of oneDNN Graph API based on oneDNN v3.0.1.
Performance Optimizations
- Improved multi-level perceptron (MLP) and residual block subgraphs performance with oneDNN Graph Compiler backend on 4th generation Intel Xeon Scalable processors (formerly Sapphire Rapids).
- Improved dynamic shape performance for MLP and multi-head attention (MHA) patterns with oneDNN Graph Compiler backend.
- Improved performance of oneDNN Graph Compiler built-in code generator.
Functionality
- Extended the set of multi-head attention (MHA) variants supported by oneDNN Graph Compiler.
Known Issues and Limitations
- The weight’s opaque layout can be queried only from a compiled partition.
Thanks to the Contributors
This release contains contributions from the project core teams as well as Jiong Gong, Chunyuan Wu, Sanchit Jain, Yiqiang Li, Yunfei Mao, Kiefer Kuah and others.