What's Changed
- Fix qk_norm for lighton by @Blaizzy in #615
- Fix GLM-4.6V vision types by @Blaizzy in #616
- Add support for rope parameters [GLM-4.6V MoE] by @Blaizzy in #617
- Enhance chat UI by @ivanfioravanti in #619
- Add support for TokenizersBackend by @Blaizzy in #621
- UI by @ivanfioravanti in #620
- docs: fix /chat/completion endpoint typo in README by @zenyr in #623
- Add Support for HunyuanOCR (hunyuan_vl) by @voxmenthe in #604
- Add Batch Generation by @Blaizzy in #538
- Add autoflake (remove unused imports) by @Blaizzy in #628
- Bump version to 0.3.10 by @Blaizzy in #632
- Fix qwen3_vl_moe kwargs forwarding by @mattjcly in #633
- Add support for Jina VLM by @hanxiao in #631
- Fix: Skip index.json in file copy to prevent overwrite by @mzau in #638
- Add Molmo2 support by @aliyovic in #639
- Fix glm46v by @Blaizzy in #629
- migrate qwen3omni to MLX by @hellopahe in #598
- Fix audio loading by @Blaizzy in #642
- Fix pixtral text-only regression from batch generation PR by @mzau in #644
- Fix: fixes mlx-vlm.generate --chat when prompted with no imgages by @Deekshith-Dade in #648
- Enhance load_config to include generation_config.json and extract eos… by @cubist38 in #650
- Add LFM2.5-VL by @Blaizzy in #653
- Add MXFP4 Quantization Support by @zhutao100 in #514
- Add support for nvfp4 and mxfp8 by @Blaizzy in #657
- Fix server audio, chat and trust remote code by @Blaizzy in #660
- Fix tokenizer by @Blaizzy in #662
- Pass kwargs through to snapshot_download in get_model_path by @Deekshith-Dade in #663
- Revert "Pass kwargs through to snapshot_download in get_model_path" by @Blaizzy in #666
- [Model] Add PaddleOCR-VL Model Support by @zhang-prog in #656
- Remove _has_bytelevel_pretokenizer by @pcuenca in #668
- Feat/structured outputs by @cubist38 in #664
- Fix: Correct position embedding for LLM in PaddleOCR-VL by @zhang-prog in #672
- Use mx.fast.scaled_dot_product_attention in pixtral vision by @pherber3 in #677
- fix: Remove invalid parameter "None", explicitly specify assignment p… by @neil0306 in #687
- Add text prefill and input embeddings obj by @Blaizzy in #681
- Make sure weights are released during conversion by @pcuenca in #691
- Add deepseek ocr 2 by @Blaizzy in #690
- fix: prevent base64 image data from being tokenized as text by @ne2030 in #685
- Add Ernie-4.5-VL by @Blaizzy in #627
- TFMS v5 RC3 + Fix processor registry by @Blaizzy in #693
New Contributors
- @voxmenthe made their first contribution in #604
- @hanxiao made their first contribution in #631
- @mzau made their first contribution in #638
- @aliyovic made their first contribution in #639
- @hellopahe made their first contribution in #598
- @Deekshith-Dade made their first contribution in #648
- @cubist38 made their first contribution in #650
- @zhutao100 made their first contribution in #514
- @zhang-prog made their first contribution in #656
- @pherber3 made their first contribution in #677
- @neil0306 made their first contribution in #687
- @ne2030 made their first contribution in #685
Full Changelog: v0.3.9...v0.3.10