github flashinfer-ai/flashinfer v0.0.9

latest releases: nightly-v0.6.11-20260517, nightly-v0.6.11-20260516, v0.6.11.post3...
22 months ago

0.0.9 (2024-07-12)

Bugfix

  • fix the decode kernel segfault in cudagraph mode (#368)(c69cfa)
  • fix decode kernels output for empty kv cache (#363)(ac72b1)
  • check gpu id in PyTorch APIs and use input tensor's gpu default stream (#361)(1b84fa)

Performance Improvements

Acknowledgement

We thank @Yard1, @Ying1123 and @zhyncs for their contributions.

Don't miss a new flashinfer release

NewReleases is sending notifications on new releases.