What's Changed
- Re-enable fusion f16 conv + bn regression tests (#4920) @laggui
- Enable & fix cubecl tests w/ fusion (#4917) @laggui
- Fusion tests (#4872) @nathanielsimard
- Adding complex to complex FFT implementation (#4903) @RunjiaChen
- Add cubecl integration to topk (#4906) @Sublime12
- Extract float math ops in a new trait (#4891) @skewballfox
- Add ParamId::try_deserialize() (#4881) @crutcher
- Add Clone + 'static bounds to LrScheduler::Record and derive Clone for scheduler records (#4905) @crutcher
linalg::lu: Improve numerical handling and small perf cleanup (#4902) @softmaximalist- Add fusion integration for argtopk (#4904) @Sublime12
- Add argtopk for Cubecl backend (#4900) @Sublime12
- Update CubeK: tile matmul refactor (#4901) @louisfd
- Use gather_nd in RNN-T gather_loss (#4895) @antimora
- Fix cubecl cross product on non-last dimension (#4850) (#4850) @dschulmeist
- Fix
PytorchReaderbugs to load legacy files correctly (#4897) @softmaximalist - Add native impementations for scatter_nd / gather_nd; provide autodiff for assign & add (#4709) @cu9hue
Full Changelog: v0.21.0-pre.4...v0.21.0-pre.5