github opendatalab/MinerU mineru-3.3.1-released

2 hours ago

What's Changed

  • 2026/06/11 3.3 Released

    This release focuses on Hybrid parsing performance optimization and VLM model capability upgrades. The main updates include:

    • New effort parsing-strength parameter for the Hybrid backend

      • Added two parsing-strength levels, medium and high, allowing users to balance parsing speed, parsing accuracy, and feature requirements.
      • On OmniDocBench v1.6, medium reduces overall accuracy by only 0.13 points compared with high, while delivering 35% ~ 220% parsing speed improvements across different devices and scenarios:
        • Linux: about 80% faster for text PDF scenarios and about 35% faster for OCR scenarios
        • Windows: about 90% faster for text PDF scenarios and about 45% faster for OCR scenarios
        • macOS: about 220% faster for text PDF scenarios and about 50% faster for OCR scenarios
      • The default Hybrid backend now uses effort=medium, significantly improving overall parsing efficiency while maintaining high parsing accuracy.
      • The medium level does not support image analysis; for maximum parsing accuracy or image analysis support, switch to the high-strength parsing mode with effort=high, which may have an impact on parsing speed.
    • VLM model upgraded to MinerU2.5-Pro-2605-1.2B

      • Fixed multiple model issues found in the 2604 version, further improving parsing stability on complex documents.
      • Added native multilingual OCR support, reducing the need for extra language-parameter configuration and improving out-of-the-box usability for multilingual documents.

    With the 3.3 release, MinerU further improves Hybrid backend efficiency across platforms and scenarios while maintaining high-accuracy parsing. The default medium effort level is better suited for most day-to-day document processing tasks, while high is designed for scenarios that require maximum parsing accuracy or image analysis capabilities.

  • 2026/06/11 3.3 发布

    本次版本更新聚焦于 Hybrid 解析性能优化VLM 模型能力升级。主要更新内容包括:

    • Hybrid 后端新增 effort 解析强度参数

      • 新增 mediumhigh 两档解析强度,用户可根据解析速度、解析精度和功能需求灵活选择。
      • 在 OmniDocBench v1.6 评测中,medium 相比 high 综合精度仅降低 0.13,但在不同设备和场景下可获得 35% ~ 220% 的解析速度提升:
        • Linux:文本 PDF 场景提升约 80%,OCR 场景提升约 35%
        • Windows:文本 PDF 场景提升约 90%,OCR 场景提升约 45%
        • macOS:文本 PDF 场景提升约 220%,OCR 场景提升约 50%
      • 默认 Hybrid 后端将使用 effort=medium,在保持高解析精度的同时显著提升整体解析效率。
      • medium 档不支持 image analysis(图片/图表分析)功能;如需极致解析精度或启用 image analysis,可通过 effort=high 切换至高强度解析模式,但解析速度会受到一定影响。
    • VLM 模型升级至 MinerU2.5-Pro-2605-1.2B

      • 修复 2604 版本中存在的多处模型问题,进一步提升复杂文档场景下的解析稳定性。
      • 原生支持多语言 OCR,降低多语言文档解析时对额外语言参数配置的依赖,提升跨语言场景的开箱即用体验。

    通过 3.3 版本,MinerU 在保持高精度解析能力的同时,进一步提升了 Hybrid 后端在多平台、多场景下的解析效率。默认 medium 解析强度更适合大多数日常文档处理任务,而 high 模式则面向对解析精度和 image analysis 能力有更高要求的场景。

Full Changelog: mineru-3.2.3-released...mineru-3.3.1-released

Don't miss a new MinerU release

NewReleases is sending notifications on new releases.