EXO v1.0.71 Release Notes
This is a small patch release, with better defaults for sampling and bug fixes for M5 series Macs and RDMA, along with support for the new Kimi K2.6.
Models
- Add Kimi K2.6, including support for multimodality (#1952)
- Add model cards for Qwen3.6 27B, GLM 5.1 and Kimi K2.6 quantisations (#1970)
Performance
- Add recommended sampling defaults (from model providers) to improve model output quality, including min_p and top_k (#1947)
- Prevent JACCL all_sum from corrupting output (#1952)
Quality of Life
- Add a Pi integration tab! (#1925)
- Handle multiple tool calls correctly in the Claude and Responses APIs (#1952)
- Add a friendly context message for bug reporting (#1959)
Bug Fixes
- Fix vision models on M5 series Macs by fixing addmm on NAX (#1927)
- Fix prefix cache for Qwen3.6 (#1952)
- Prevent complete downloads from losing their Completed status (#1956, #1958)
- Populate the image cache from events to prevent crashes (#1961)
- Fix DeepSeekV32 parsing and tensor parallelism for Gemma 4 models with effective parameters (#1967)
New Contributors
And a thank you to @nhwaani who made their first contribution here!
Full Changelog: v1.0.70...v1.0.71