What's Changed
- Fix gemma3n short prompts by @Blaizzy in #751
- Adding full weight finetuning by @Goekdeniz-Guelmez in #499
- Initialize qwen3_5_moe.LanguageModel with _position_ids by @will-lms in #753
- Fix batch inference to use InputEmbeddingsFeatures by @Blaizzy in #760
- fix(qwen3_vl): reset _position_ids when new image/video arrives by @kol22 in #756
- fix: replace 13 bare excepts with except Exception by @haosenwang1018 in #765
- Reset position ids for Qwen3-VL-MoE image inputs by @kol22 in #761
- Fix text only single batch by @Blaizzy in #771
- Update LORA.md for the new trainer by @Goekdeniz-Guelmez in #775
- Support /v1/ prefix for OpenAI compatible endpoints by @spicyneuron in #783
- CORS middleware by @viktike in #766
- fx: README curl error by @yinzhidong in #788
- Streaming chatml response enhancements. by @viktike in #764
- Add KV cache (quantization) parameters to server. by @viktike in #776
- Fix mllama training and add more models in mapp by @Goekdeniz-Guelmez in #777
- Add thinking budget and flag by @Blaizzy in #789
- fix: add import guard for gradio in chat_ui entry point by @RtYkk in #787
- Tool calling in server by @viktike in #773
- Add Minicpm-o-2.5 by @Blaizzy in #791
- Add prefill-step-size optional command line argument for server. by @viktike in #792
- qwen3_omni_moe: fix visual_embeds_multiscale UnboundLocalError by @ronaldseoh in #794
- Add Phi-4-reasoning-vision-15B (phi4-siglip) by @Blaizzy in #796
- Add phi4mm by @Blaizzy in #797
- Add ensure_fused_sdpa function for optimized attention computation by @Blaizzy in #800
- [Internvl_chat] Add optional kwargs parameter to LanguageModel by @Blaizzy in #801
- Guard load img & audio by @Blaizzy in #803
- [Qwen3.5 MoE] Fix quant predicate by @Blaizzy in #804
- Adding orpo by @Goekdeniz-Guelmez in #795
- Add moondream3 by @Blaizzy in #807
- Remove sleep duration with zero delay by @Blaizzy in #808
- Update dependencies and version number by @Blaizzy in #809
New Contributors
- @kol22 made their first contribution in #756
- @haosenwang1018 made their first contribution in #765
- @spicyneuron made their first contribution in #783
- @viktike made their first contribution in #766
- @yinzhidong made their first contribution in #788
- @RtYkk made their first contribution in #787
- @ronaldseoh made their first contribution in #794
Full Changelog: v0.3.12...v0.4.0