Blaizzy/mlx-vlm v0.4.0 on GitHub

What's Changed

Fix gemma3n short prompts by @Blaizzy in #751
Adding full weight finetuning by @Goekdeniz-Guelmez in #499
Initialize qwen3_5_moe.LanguageModel with _position_ids by @will-lms in #753
Fix batch inference to use InputEmbeddingsFeatures by @Blaizzy in #760
fix(qwen3_vl): reset _position_ids when new image/video arrives by @kol22 in #756
fix: replace 13 bare excepts with except Exception by @haosenwang1018 in #765
Reset position ids for Qwen3-VL-MoE image inputs by @kol22 in #761
Fix text only single batch by @Blaizzy in #771
Update LORA.md for the new trainer by @Goekdeniz-Guelmez in #775
Support /v1/ prefix for OpenAI compatible endpoints by @spicyneuron in #783
CORS middleware by @viktike in #766
fx: README curl error by @yinzhidong in #788
Streaming chatml response enhancements. by @viktike in #764
Add KV cache (quantization) parameters to server. by @viktike in #776
Fix mllama training and add more models in mapp by @Goekdeniz-Guelmez in #777
Add thinking budget and flag by @Blaizzy in #789
fix: add import guard for gradio in chat_ui entry point by @RtYkk in #787
Tool calling in server by @viktike in #773
Add Minicpm-o-2.5 by @Blaizzy in #791
Add prefill-step-size optional command line argument for server. by @viktike in #792
qwen3_omni_moe: fix visual_embeds_multiscale UnboundLocalError by @ronaldseoh in #794
Add Phi-4-reasoning-vision-15B (phi4-siglip) by @Blaizzy in #796
Add phi4mm by @Blaizzy in #797
Add ensure_fused_sdpa function for optimized attention computation by @Blaizzy in #800
[Internvl_chat] Add optional kwargs parameter to LanguageModel by @Blaizzy in #801
Guard load img & audio by @Blaizzy in #803
[Qwen3.5 MoE] Fix quant predicate by @Blaizzy in #804
Adding orpo by @Goekdeniz-Guelmez in #795
Add moondream3 by @Blaizzy in #807
Remove sleep duration with zero delay by @Blaizzy in #808
Update dependencies and version number by @Blaizzy in #809

New Contributors

@kol22 made their first contribution in #756
@haosenwang1018 made their first contribution in #765
@spicyneuron made their first contribution in #783
@viktike made their first contribution in #766
@yinzhidong made their first contribution in #788
@RtYkk made their first contribution in #787
@ronaldseoh made their first contribution in #794

Full Changelog: v0.3.12...v0.4.0