What's Changed
- custom dsv32 chat template by @awni in #693
- shard glm by @awni in #698
- support minimax m2 by @awni in #700
- Enhance load_config function to check for config file existence and i… by @cubist38 in #701
- batch_generate fails with Phi3 (LongRoPE) when prompts have different lengths by @vyaivanove in #707
- Fix GIL starvation in _generate thread when batch is idle by @sjug in #706
- Ignore generation_config decode errors by @will-lms in #708
- Allow mxfp8 and nvfp4 by @awni in #709
- Fix chat template detection for models with custom tokenizers by @kernelpool in #712
- chore: add model-path param flag for convert API for better clarity by @jaycoolslm in #702
- Add RWKV7 by @MollySophia in #580
- Fix empty /v1/models response for locally loaded models by @cxl-git-hub in #713
- Add IQuest Coder V1 by @kernelpool in #714
- Add YoutuLLM by @johnmai-dev in #720
- Add logits_processors support to batch_generate by @lazarust in #635
- Add Solar Open by @kernelpool in #721
- Add K-EXAONE MoE by @kernelpool in #719
- Improve reasoning and tool call parsing in server by @awni in #711
- Patch bump by @awni in #731
New Contributors
- @cubist38 made their first contribution in #701
- @vyaivanove made their first contribution in #707
- @sjug made their first contribution in #706
- @jaycoolslm made their first contribution in #702
- @MollySophia made their first contribution in #580
- @cxl-git-hub made their first contribution in #713
- @lazarust made their first contribution in #635
Full Changelog: v0.30.0...v0.30.1