Dao-AILab/flash-attention fa4-v4.0.0.beta15 on GitHub

What's Changed

Wrap mask contruction in a function for mask subclassing by @sryap in #2584
Build Fix: Update abi3 tag to cp310 and minimum python version to 3.10 by @aw920h in #2532
[Cute,Flex,Sm100] vectorized mask_mod by @reubenconducts in #2261
[CuTe, SM103] Update architecture assertion for SM 10.x and 11.x by @ocss884 in #2572
Include sm_110 in Blackwell-family arch gating (follow-up to #2572) by @Johnsonms in #2590
Use is_family_of for sm_90 and sm_103 arch checks by @Johnsonms in #2589

Full Changelog: fa4-v4.0.0.beta14...fa4-v4.0.0.beta15