This is a patch release for ONNX Runtime 1.24, containing bug fixes and execution provider updates.
Bug Fixes
- Core: Added PCI bus fallback for Linux GPU device discovery in containerized environments (e.g., AKS/Kubernetes) where
nvidia-drmis not loaded but GPU PCI devices are still exposed via sysfs. (#27591) - Plugin EP: Fixed null pointer dereference when iterating output spans in
GetOutputIndex. (#27644) - Plugin EP: Fixed bug that incorrectly assigned duplicate MetaDef IDs to fused nodes in different GraphViews (e.g., then/else branches of an If node), causing session creation to fail with a conflicting kernel error. (#27666)
Execution Provider Updates
- QNN EP: Enabled offline x64 compilation with memhandle IO type by deferring rpcmem library loading to inference time. (#27479)
- QNN EP: Reverted QNN SDK logging verbosity changes that caused segmentation faults on backend destruction. (#27650)
Build and Infrastructure
- Python: Updated
python_requiresfrom>=3.10to>=3.11to reflect dropped Python 3.10 support. (#27354) - Build: Replaced
__builtin_ia32_tpausewith the compiler-portable_tpauseintrinsic to fix cross-compiler portability issues between GCC and LLVM. (#27607)
Full Changelog: v1.24.3...v1.24.4
Contributors
@derdeljan-msft, @adrianlizarraga, @apwojcik, @baijumeswani, @edgchen1, @mocknen, @tianleiwu, @XXXXRT666