"Workflow Greatness" - Automated Document Processing Pipeline 🔄
New Features 🚀
- OCR Completion Tags
- Automatically tag documents after OCR processing is complete using
PDF_OCR_COMPLETE_TAG
- Enable workflow automations based on OCR completion status
- Works seamlessly with all OCR providers
- Automatically tag documents after OCR processing is complete using
Quick Setup Guide 🛠️
environment:
# For automatic OCR processing
AUTO_OCR_TAG: "paperless-gpt-ocr-auto" # Tag documents to trigger automatic OCR
# Enable OCR completion tagging
PDF_OCR_TAGGING: "true"
PDF_OCR_COMPLETE_TAG: "paperless-gpt-ocr-complete" # Tag added after OCR completion
How It Works 📋
- Tag a document with
paperless-gpt-ocr-auto
to queue it for OCR processing - Document is processed using your configured OCR provider
- Upon completion, the document is automatically tagged with
paperless-gpt-ocr-complete
- Use these tags to trigger further automations or workflows
Technical Improvements ⚙️
- Enhanced test coverage for OCR workflow automation
- Improved document processor architecture
- Optimized test execution with parallel workers
What's Changed
- Fix OCR completion tag and adds OCR document processing E2E test by @icereed in #379
- fix(deps): update module gorm.io/gorm to v1.26.0 by @renovate in #377
- chore(deps): update dependency vite to v6.3.5 by @renovate in #381
- chore(deps): update dependency @types/react-dom to v19.1.3 by @renovate in #365
- chore(deps): update dependency typescript-eslint to v8.31.1 by @renovate in #337
- fix(deps): update module cloud.google.com/go/documentai to v1.37.0 by @renovate in #373
- chore(deps): update eslint monorepo to v9.26.0 by @renovate in #332
- chore(deps): update dependency eslint-plugin-react-refresh to v0.4.20 by @renovate in #349
- fix(deps): update github.com/gardar/ocrchestra digest to 5fe7a0d by @renovate in #376
- fix(deps): update module golang.org/x/sync to v0.14.0 by @renovate in #382
Full Changelog: v0.17.0...v0.17.1