github NVIDIA/cutlass v4.3.1
CUTLASS 4.3.1

3 hours ago

CuTe DSL

  • New features
    • Added Blackwell SM103 support
    • Multiple dependent DSOs in the wheel have been merged into one single DSO
  • Bug fixing and improvements
    • Fixed device reset issue with tvm-ffi
    • Fixed tvm-ffi export compiled function

CUTLASS C++

  • Support blockscaled variant of ragged contiguous grouped gemm with the new simplified MoE API in example 92.
    • The new example works for all microscaling types.

Don't miss a new cutlass release

NewReleases is sending notifications on new releases.