0.1.3 (2024-07-31)
Bugfix
- bugfix: Fix cudagraph mode of BatchPrefillWithRaggedKVCacheWrapper (#412) (9907bc)
- fix cu118 cub usage for sampling kernels (#410) (58d359)
Don't miss a new flashinfer release
NewReleases is sending notifications on new releases.