github NVIDIA/cutlass v4.5.2
CUTLASS 4.5.2

7 hours ago

CuTe DSL

  • New features

    • Python 3.14t is now supported with GIL enabled
  • Bug fixing and improvements

CUTLASS C++

  • Fix missing convert fucntion in EVT for fp4 kernels.
  • Avoid instantiate 2sm tma kernels where ctaN is none power of 64 when ctaN > 128 in profiler.

Don't miss a new cutlass release

NewReleases is sending notifications on new releases.