What's Changed
- version bump by @angeloskath in #651
- Fix slow batch generation in server by setting wired_limit by @otarkhan in #652
- Fix RoPE for rnj-1 by @awni in #657
- fix: calling correct dequantize function by @devnamrits in #666
- Use test data zipfile in CI by @awni in #662
- Default repetition penalty to 0.0 in the server by @awni in #658
- fix dsv32 and gemma3 by @awni in #664
- Fix fusion and test by @awni in #668
- Fix server batching condition for SSMs by @angeloskath in #655
- Fix SuScaledRoPE by @DePasqualeOrg in #660
- Fix DSV32 by @awni in #669
- Fix for Devstral-2 by @inferencers in #671
- support nemotron 3 by @awni in #678
New Contributors
- @otarkhan made their first contribution in #652
- @devnamrits made their first contribution in #666
- @DePasqualeOrg made their first contribution in #660
- @inferencers made their first contribution in #671
Full Changelog: v0.28.4...v0.29.0