github NVIDIA/cutlass v4.2.1
CUTLASS 4.2.1

19 hours ago

CuTe DSL

  • Bug fixings and improvements
    • Fixed an issue when running DSL codes with cuda-python 13.0
    • Fixed an issue when running inductor with DSL codes
    • Fixed an issue with unexpected logging when running DSL codes in FlashInfer
    • Fixed the issue reported in #2647
    • Fixed an issue when conditional define of variables outside of dynamic control flow

CUTLASS C++

  • Bypass EVT for nosmem blockwise kernels on Blackwell.
  • Rename cutlass/python/cutlass directory to cutlass/python/cutlass_cppgen.

Don't miss a new cutlass release

NewReleases is sending notifications on new releases.