This is a patch release for ONNX Runtime 1.24, containing several bug fixes, security improvements, and execution provider updates.
Bug Fixes
- NuGet: Fixed native library loading issues in the ONNX Runtime NuGet package on Linux and macOS. (#27266)
- macOS: Fixed Java support and Jar testing on macOS ARM64. (#27271)
- Core: Enable Robust Symlink Support for External Data for Huggingface Hub Cache. (#27374)
- Core: Added boundary checks for
SparseTensorProtoToDenseTensorPrototo improve robustness. (#27323) - Security: Fixed an out-of-bounds read vulnerability in
ArrayFeatureExtractor. (#27275)
Execution Provider Updates
- MLAS: Fixed flakiness and accuracy issues in Lut GEMM (MatMulNBitsLutGemm). (#27216)
- QNN: Enabled 64-bit UDMA mode for HTP target v81 or above. (#26677)
- WebGPU:
- OpenVINO (OVEP): Patch to reduce resident memory by reusing weight files across shared contexts. (#27238)
- DNNL: Fixed DNNL build error by including missing files. (#27334)
Build and Infrastructure
- CUDA:
- Pipelines:
- Testing: Updated
BaseTesterto support plugin EPs with both compiled nodes and registered kernels. (#27176) - Telemetry: Added service name and framework name to telemetry events for better usage understanding on Windows. (#27252, #27256)
Full Changelog: v1.24.1...v1.24.2
Contributors
@tianleiwu, @hariharans29, @edgchen1, @xiaofeihan1, @adrianlizarraga, @angelser, @angelserMS, @ankitm3k, @baijumeswani, @bmehta001, @ericcraw, @eserscor, @fs-eire, @guschmue, @mc-nv, @qjia7, @qti-monumeen, @titaiwangms, @yuslepukhin