What's Changed
- Fix Attention GQA implementation on CPU (#25966)
- Address edge GetMemInfo edge cases (#26021)
- Implement new Python APIs (#25999)
- MemcpyFromHost and MemcpyToHost support for plugin EPs (#26088)
- [TRT RTX EP] Fix bug for generating the correct subgraph in GetCapability (#26132)
- add session_id_ to LogEvaluationStart/Stop, LogSessionCreationStart (#25590)
- [build] fix WebAssembly build on macOS/arm64 (#25653)
- [CPU] MoE Kernel (#25958)
- [CPU] Block-wise QMoE kernel for CPU (#26009)
- [C#] Implement missing APIs (#26101)
- Regenerate test model with ONNX IR < 12 (#26149)
- [CPU] Fix compilation errors because of unused variables (#26147)
- [EP ABI] Check if nodes specified in GetCapability() have already been assigned (#26156)
- [QNN EP] Add dynamic option to set HTP performance mode (#26135)
Full Changelog: v1.23.0...v1.23.1