pypi onnxruntime 1.23.1
ONNX Runtime v1.23.1

11 hours ago

What's Changed

  • Fix Attention GQA implementation on CPU (#25966)
  • Address edge GetMemInfo edge cases (#26021)
  • Implement new Python APIs (#25999)
  • MemcpyFromHost and MemcpyToHost support for plugin EPs (#26088)
  • [TRT RTX EP] Fix bug for generating the correct subgraph in GetCapability (#26132)
  • add session_id_ to LogEvaluationStart/Stop, LogSessionCreationStart (#25590)
  • [build] fix WebAssembly build on macOS/arm64 (#25653)
  • [CPU] MoE Kernel (#25958)
  • [CPU] Block-wise QMoE kernel for CPU (#26009)
  • [C#] Implement missing APIs (#26101)
  • Regenerate test model with ONNX IR < 12 (#26149)
  • [CPU] Fix compilation errors because of unused variables (#26147)
  • [EP ABI] Check if nodes specified in GetCapability() have already been assigned (#26156)
  • [QNN EP] Add dynamic option to set HTP performance mode (#26135)

Full Changelog: v1.23.0...v1.23.1

Don't miss a new onnxruntime release

NewReleases is sending notifications on new releases.