0.0.4 (2024-05-01)
Features
- pytorch 2.3 support
- more gqa group sizes
- add mma instructions for fp8 (#179) (d305798)
- mma rowsum for fp8 (#180) (5af935c)
- support any num_heads for get_alibi_slope (#200) (b217a6f)
Don't miss a new flashinfer release
NewReleases is sending notifications on new releases.