Blaizzy/mlx-vlm v0.3.10 on GitHub

What's Changed

Fix qk_norm for lighton by @Blaizzy in #615
Fix GLM-4.6V vision types by @Blaizzy in #616
Add support for rope parameters [GLM-4.6V MoE] by @Blaizzy in #617
Enhance chat UI by @ivanfioravanti in #619
Add support for TokenizersBackend by @Blaizzy in #621
UI by @ivanfioravanti in #620
docs: fix /chat/completion endpoint typo in README by @zenyr in #623
Add Support for HunyuanOCR (hunyuan_vl) by @voxmenthe in #604
Add Batch Generation by @Blaizzy in #538
Add autoflake (remove unused imports) by @Blaizzy in #628
Bump version to 0.3.10 by @Blaizzy in #632
Fix qwen3_vl_moe kwargs forwarding by @mattjcly in #633
Add support for Jina VLM by @hanxiao in #631
Fix: Skip index.json in file copy to prevent overwrite by @mzau in #638
Add Molmo2 support by @aliyovic in #639
Fix glm46v by @Blaizzy in #629
migrate qwen3omni to MLX by @hellopahe in #598
Fix audio loading by @Blaizzy in #642
Fix pixtral text-only regression from batch generation PR by @mzau in #644
Fix: fixes mlx-vlm.generate --chat when prompted with no imgages by @Deekshith-Dade in #648
Enhance load_config to include generation_config.json and extract eos… by @cubist38 in #650
Add LFM2.5-VL by @Blaizzy in #653
Add MXFP4 Quantization Support by @zhutao100 in #514
Add support for nvfp4 and mxfp8 by @Blaizzy in #657
Fix server audio, chat and trust remote code by @Blaizzy in #660
Fix tokenizer by @Blaizzy in #662
Pass kwargs through to snapshot_download in get_model_path by @Deekshith-Dade in #663
Revert "Pass kwargs through to snapshot_download in get_model_path" by @Blaizzy in #666
[Model] Add PaddleOCR-VL Model Support by @zhang-prog in #656
Remove _has_bytelevel_pretokenizer by @pcuenca in #668
Feat/structured outputs by @cubist38 in #664
Fix: Correct position embedding for LLM in PaddleOCR-VL by @zhang-prog in #672
Use mx.fast.scaled_dot_product_attention in pixtral vision by @pherber3 in #677
fix: Remove invalid parameter "None", explicitly specify assignment p… by @neil0306 in #687
Add text prefill and input embeddings obj by @Blaizzy in #681
Make sure weights are released during conversion by @pcuenca in #691
Add deepseek ocr 2 by @Blaizzy in #690
fix: prevent base64 image data from being tokenized as text by @ne2030 in #685
Add Ernie-4.5-VL by @Blaizzy in #627
TFMS v5 RC3 + Fix processor registry by @Blaizzy in #693

New Contributors

@voxmenthe made their first contribution in #604
@hanxiao made their first contribution in #631
@mzau made their first contribution in #638
@aliyovic made their first contribution in #639
@hellopahe made their first contribution in #598
@Deekshith-Dade made their first contribution in #648
@cubist38 made their first contribution in #650
@zhutao100 made their first contribution in #514
@zhang-prog made their first contribution in #656
@pherber3 made their first contribution in #677
@neil0306 made their first contribution in #687
@ne2030 made their first contribution in #685

Full Changelog: v0.3.9...v0.3.10