ONNX Runtime Release 1.25.1
📢 Announcements & Breaking Changes
ONNX Op Updates
- Enhanced ONNX operator support with new opset versions: Reshape (opset 25), Transpose (opset 24) (#27752)
✨ New Features
📊 New ONNX Ops & Model Support
- LinearAttention and CausalConvState operators for Qwen3.5 model support (#27907)
- RotaryEmbedding (RotEMB) and RMSNorm operators added (#27752)
- Linear Attention signature support (#27842)
🌐 Web & JavaScript
WebGPU EP
- Qwen3.5 model support on WebGPU execution provider (#27996)
- QMoE 1-token decode path optimization — fused operations to reduce GPU dispatches for improved performance (#27998)
🐛 Bug Fixes
Core Runtime Fixes
- Improved filesystem error messages during Linux device discovery for better debugging experience (#27289)
- Fixed missing include for
SetRawDataInTensorProtoin NVIDIA TensorRT RTX tests (#28065)
🙏 Contributors
Thanks to our 7 contributors for this release:
@guschmue, @sanaa-hamel-microsoft, @apsonawane, @eserscor, @ishwar-raut1, @qjia7, @theHamsta
Full Changelog: v1.25.0...v1.25.1