What's Changed
- Add dsv3 for lora by @awni in #284
- GPTQ quantization by @awni in #279
- Fix MoE fine tuning by @awni in #288
- fix hunyuan by @awni in #286
- Allow trust_remote_code in convert.py by @christian-lms in #289
- Type Signature Fixes by @MattBeton in #290
- Fix gemma3n config load bug by @neilmehta24 in #292
- kimi k2 by @awni in #293
- Add LFM2 by @Blaizzy in #291
- feat: DWQ for Hunyuan-A13B-Instruct and trust_remote_code argument by @ivanfioravanti in #303
- fix: update import for huggingface model in evaluate.py by @ivanfioravanti in #275
- Fix ddp workers loading the same data by @angeloskath in #294
- Fix server finish reason by @awni in #307
- Add support for SGD & Adafactor by @N8python in #306
- Allow empty prompt with input_embeddings by @will-lms in #308
- fix naive detokenizer by @awni in #312
- add exaone4 by @awni in #310
- add v1/models/repo_id by @awni in #313
- Update W&B logging crash in MLX-LM-LORA by @Goekdeniz-Guelmez in #316
- Fix DSV3 training by @awni in #324
- Lora works with cuda backend by @awni in #330
- Adding Muon Optimizer by @Goekdeniz-Guelmez in #325
New Contributors
- @christian-lms made their first contribution in #289
- @MattBeton made their first contribution in #290
- @N8python made their first contribution in #306
Full Changelog: v0.26.0...v0.26.1