What's Changed
-
2026/06/11 3.3 Released
This release focuses on Hybrid parsing performance optimization and VLM model capability upgrades. The main updates include:
-
New
effortparsing-strength parameter for the Hybrid backend- Added two parsing-strength levels,
mediumandhigh, allowing users to balance parsing speed, parsing accuracy, and feature requirements. - On OmniDocBench v1.6,
mediumreduces overall accuracy by only0.13points compared withhigh, while delivering35%~220%parsing speed improvements across different devices and scenarios:- Linux: about
80%faster for text PDF scenarios and about35%faster for OCR scenarios - Windows: about
90%faster for text PDF scenarios and about45%faster for OCR scenarios - macOS: about
220%faster for text PDF scenarios and about50%faster for OCR scenarios
- Linux: about
- The default Hybrid backend now uses
effort=medium, significantly improving overall parsing efficiency while maintaining high parsing accuracy. - The
mediumlevel does not supportimage analysis; for maximum parsing accuracy orimage analysissupport, switch to the high-strength parsing mode witheffort=high, which may have an impact on parsing speed.
- Added two parsing-strength levels,
-
VLM model upgraded to
MinerU2.5-Pro-2605-1.2B- Fixed multiple model issues found in the
2604version, further improving parsing stability on complex documents. - Added native multilingual OCR support, reducing the need for extra language-parameter configuration and improving out-of-the-box usability for multilingual documents.
- Fixed multiple model issues found in the
With the 3.3 release, MinerU further improves Hybrid backend efficiency across platforms and scenarios while maintaining high-accuracy parsing. The default
mediumeffort level is better suited for most day-to-day document processing tasks, whilehighis designed for scenarios that require maximum parsing accuracy orimage analysiscapabilities. -
-
2026/06/11 3.3 发布
本次版本更新聚焦于 Hybrid 解析性能优化 与 VLM 模型能力升级。主要更新内容包括:
-
Hybrid 后端新增
effort解析强度参数- 新增
medium与high两档解析强度,用户可根据解析速度、解析精度和功能需求灵活选择。 - 在 OmniDocBench v1.6 评测中,
medium相比high综合精度仅降低0.13,但在不同设备和场景下可获得35%~220%的解析速度提升:- Linux:文本 PDF 场景提升约
80%,OCR 场景提升约35% - Windows:文本 PDF 场景提升约
90%,OCR 场景提升约45% - macOS:文本 PDF 场景提升约
220%,OCR 场景提升约50%
- Linux:文本 PDF 场景提升约
- 默认 Hybrid 后端将使用
effort=medium,在保持高解析精度的同时显著提升整体解析效率。 medium档不支持image analysis(图片/图表分析)功能;如需极致解析精度或启用image analysis,可通过effort=high切换至高强度解析模式,但解析速度会受到一定影响。
- 新增
-
VLM 模型升级至
MinerU2.5-Pro-2605-1.2B- 修复
2604版本中存在的多处模型问题,进一步提升复杂文档场景下的解析稳定性。 - 原生支持多语言 OCR,降低多语言文档解析时对额外语言参数配置的依赖,提升跨语言场景的开箱即用体验。
- 修复
通过 3.3 版本,MinerU 在保持高精度解析能力的同时,进一步提升了 Hybrid 后端在多平台、多场景下的解析效率。默认
medium解析强度更适合大多数日常文档处理任务,而high模式则面向对解析精度和image analysis能力有更高要求的场景。 -
Full Changelog: mineru-3.2.3-released...mineru-3.3.1-released