github Dao-AILab/flash-attention fa4-v4.0.0.beta15

pre-release4 hours ago

What's Changed

  • Wrap mask contruction in a function for mask subclassing by @sryap in #2584
  • Build Fix: Update abi3 tag to cp310 and minimum python version to 3.10 by @aw920h in #2532
  • [Cute,Flex,Sm100] vectorized mask_mod by @reubenconducts in #2261
  • [CuTe, SM103] Update architecture assertion for SM 10.x and 11.x by @ocss884 in #2572
  • Include sm_110 in Blackwell-family arch gating (follow-up to #2572) by @Johnsonms in #2590
  • Use is_family_of for sm_90 and sm_103 arch checks by @Johnsonms in #2589

New Contributors

Full Changelog: fa4-v4.0.0.beta14...fa4-v4.0.0.beta15

Don't miss a new flash-attention release

NewReleases is sending notifications on new releases.