github opendatalab/MinerU mineru-3.1.0-released

4 hours ago

What's Changed

  • 2026/04/18 3.1.0 Released

    This release focuses on licensing openness, parsing accuracy, and full-format native support. The main updates include:

    • License upgrade
      • MinerU has officially moved from AGPLv3 to the MinerU Open Source License, a custom license based on Apache 2.0.
      • This change significantly reduces adoption friction for both community users and commercial deployments, making MinerU easier to integrate into real-world workflows.
    • VLM main model upgrade
      • The primary VLM model has been upgraded to MinerU2.5-Pro-2604-1.2B, bringing overall parsing accuracy to a state-of-the-art level.
      • The new model now supports image and chart parsing, truncated paragraph merging, cross-page table merging, and image recognition inside tables, further strengthening performance on complex document layouts.
    • Full-format native parsing support
      • Native parsing support has now been extended to PPTX and XLSX.
      • MinerU now fully supports parsing across images, PDF, DOCX, PPTX, and XLSX, providing a more complete multi-format document understanding workflow.
  • 2026/04/18 3.1.0 发布

    本次版本更新聚焦于许可协议开放性、解析精度提升与全格式原生支持。主要更新内容包括:

    • 许可协议升级
      • MinerU 已正式从 AGPLv3 切换至基于 Apache 2.0MinerU 开源许可证
      • 新的许可方式在兼顾开源协作与商业落地的同时,进一步降低了社区使用和商业化接入门槛,让 MinerU 更容易融入真实业务流程。
    • VLM 主模型升级
      • VLM 主模型正式切换为 MinerU2.5-Pro-2604-1.2B,整体解析精度提升至业内领先水平。
      • 新模型现已支持子图切分合并、图像与图表解析、截断段落合并、跨页面表格合并以及表格内图像识别,复杂版面场景下的解析能力进一步增强。
    • 全格式原生解析支持
      • 新增 PPTXXLSX 原生解析能力。
      • 至此,MinerU 已完整支持图片、PDFDOCXPPTXXLSX 全格式解析,为多类型文档统一处理提供了更完整的能力闭环。

Full Changelog: mineru-3.0.9-released...mineru-3.1.0-released

Don't miss a new MinerU release

NewReleases is sending notifications on new releases.