2.1.40+xpu
We are excited to announce the release of Intel® Extension for PyTorch* v2.1.40+xpu. This is a minor release which supports Intel® GPU platforms (Intel® Data Center GPU Flex Series, Intel® Data Center GPU Max Series, Intel® Arc™ A-Series Graphics and Intel® Core™ Ultra Processors with Intel® Arc™ Graphics) based on PyTorch* 2.1.0.
Highlights
- Intel® oneAPI Base Toolkit 2024.2.1 compatibility
- Intel® oneDNN v3.5 integration
- Intel® oneCCL 2021.13.1 integration
- Intel® Core™ Ultra Processors with Intel® Arc™ Graphics (MTL-H) support on Windows (Prototype)
- Bug fixing and other optimization
- Fix host memory leak #4280
- Fix LayerNorm issue for undefined grad_input #4317
- Replace FP64 device check method #4354
- Fix online doc search issue #4358
- Fix pdist unit test failure on client GPUs #4361
- Remove primitive cache from conv fwd #4429
- Fix sdp bwd page fault with no grad bias #4439
- Fix implicit data conversion #4463
- Fix compiler version parsing issue #4468
- Fix irfft invalid descriptor #4480
- Change condition order to fix out-of-bound access in index #4495
- Add parameter check in embedding bag #4504
- Add the backward implementation for rms norm #4527
- Fix attn_mask for sdpa beam_search #4557
- Use data_ptr template instead of force data conversion #4558
- Workaround windows AOT image size over 2GB issue on Intel® Core™ Ultra Processors with Intel® Arc™ Graphics #4407 #4450
Known Issues
Please refer to Known Issues webpage.