0.1.2 (2024-07-29)
Bugfix
Features
- add llama 3.1 style rope (#401) (4c89dec)
- non-inplace rope operators (#405) (74ffba1)
- sliding window attention (#406) (28cffd3)
- support non-contiguous (packed) input for prefill kernels (#404) (68c3719)
Don't miss a new flashinfer release
NewReleases is sending notifications on new releases.