What's Changed
- Fix quant predicate by @awni in #485
- Fix passing argument model_config in utils.load() by @ariahw in #494
- Fix KV cache quantization for hybrid models by @awni in #495
- fix for LFM2 by @awni in #493
- fix loading for qwen2 VL by @awni in #491
- Enable training for qwen3 next by @awni in #496
- Add batch support for sliding window cache by @awni in #487
- qwen3 next batching by @awni in #478
- Add Falcon H1 by @Blaizzy in #231
- Fix RotatingKVCache update by @awni in #503
- Add Code World Model support by @dnakov in #505
- Use depends in pipeline parallel by @awni in #483
New Contributors
Full Changelog: v0.28.0...v0.28.1