Changed:
- 🔥
OcrdPageas proxy ofPcGtsTypeinstead of alias; also containsetreeandmappingnow - 🔥
Processor.zip_input_filesnow can throwocrd.NonUniqueInputFileandocrd.MissingInputFile
(the latter only ifOCRD_MISSING_INPUT=ABORT) - 🔥
Processor.zip_input_filesdoes not by default userequire_firstanymore
(so the first file in any input file tuple per page can beNoneas well) - 🔥 no more
Workspace.overwrite_mode, merely delegate toOCRD_EXISTING_OUTPUT=OVERWRITE - 🎨 improve on docs result for
ocrd_utils.config
Added:
- 👉
OCRD_DOWNLOAD_INPUTfor whether input files should be downloaded before processing - 👉
OCRD_MISSING_INPUTfor how to handle missing input files (SKIPorABORT) - 👉
OCRD_MISSING_OUTPUTfor how to handle processing failures (SKIPorABORTorCOPY)
the latter behaves like ocrd-dummy for the failed page(s) - 👉
OCRD_EXISTING_OUTPUTfor how to handle existing output files (SKIPorABORTorOVERWRITE) - new CLI option
--debugas short-hand forABORTchoices above Processor.loggerset up by constructor already (for re-use by processor implementors)default-expand and validateocrd_tool.jsoninProcessorconstructor, log invalidities- handle JSON
deprecationinocrd_tool.jsonby reporting warnings