github opendatalab/MinerU mineru-3.4.0-released

8 hours ago

What's Changed

  • 2026/06/18 3.4 Released

    This release focuses on OCR capability upgrades for the pipeline backend, OCR processing pipeline optimization, and model download experience improvements. The main updates include:

    • OCR model upgrade and processing acceleration

      • The OCR model for the pipeline backend has been upgraded to PP-OCRv6, improving OCR accuracy by about 11% on OmniDocBench v1.6.
      • Removed Japanese, Traditional Chinese, English, and Latin options from OCR language selection. These scenarios are now routed to the ch OCR model, simplifying model configuration and language selection.
      • Optimized the OCR inference and processing pipeline, increasing OCR processing speed by about 100% and significantly improving parsing efficiency for batch documents and OCR-intensive documents.
    • Model download logic optimization

      • Added automatic model source selection, allowing first-time installations to choose a better model source based on the current network environment.
      • Before downloading models, MinerU now prioritizes checking locally downloaded model cache files. Cache hits can be reused directly, reducing repeated downloads and unnecessary remote requests.
      • For more details about model source configuration, automatic source selection, and local model usage, see the Model Source Documentation.

    With the 3.4 release, MinerU further improves the parsing accuracy and processing efficiency of the pipeline backend in OCR scenarios. It also optimizes model downloads, cache reuse, and local configuration write-back, making first-time installation, model updates, and multi-environment deployment more stable and automated.

  • 2026/06/18 3.4 发布

    本次版本更新聚焦于 pipeline 后端 OCR 能力升级OCR 处理链路优化模型下载体验改进。主要更新内容包括:

    • OCR 模型升级与处理加速

      • pipeline 后端 OCR 模型更新至 PP-OCRv6,在 OmniDocBench v1.6 评测中,OCR 相关指标提升约 11%
      • 移除 OCR 语言选择中的日语、繁体中文、英语、拉丁文选项,相关场景统一路由到 ch OCR 模型,简化模型配置与语言选择逻辑。
      • 优化 OCR 推理与处理链路,OCR 处理速度提升约 100%,显著改善批量文档和 OCR 密集型文档的解析效率。
    • 模型下载逻辑优化

      • 新增模型源自动选择能力,首次安装时可根据当前网络环境自动选择更合适的模型源。
      • 下载模型前会优先检查本地已下载的模型缓存文件,命中缓存时可直接复用,减少重复下载和不必要的远端请求。
      • 更多模型源配置、自动选择策略与本地模型使用说明,请参考 模型源说明

    在 3.4 版本,MinerU 进一步提升了 pipeline 后端在 OCR 场景下的解析精度与处理效率,同时优化了模型下载、缓存复用和本地配置写入流程,让首次安装、模型更新和多环境部署更加稳定、自动化。

Full Changelog: mineru-3.3.1-released...mineru-3.4.0-released

Don't miss a new MinerU release

NewReleases is sending notifications on new releases.