papra-hq/papra @papra/lecture@0.5.0 on GitHub

#953 db6badb Thanks @CorentinTh! - Added content extraction support for scanned PDFs images in 1-bit-per-pixel grayscale format.
#948 725eaff Thanks @CorentinTh! - When extracting text from PDF documents, if neither text nor images suitable for OCR are found, the pages are rendered as images and processed with OCR. Adding support for vectorized text.

#949 ec740ed Thanks @CorentinTh! - Added document content extraction support for .xlsx and .ods files.