Deprecated
Added
- Simple wrapper to easily extract text from a PDF file #330
- Support for extracting JBIG2 encoded images (#311 and #46)
- Sphinx documentation that is published on
Read the Docs
(#329)
Fixed
- Unhandled AssertionError when dumping pdf containing reference to object id 0
(#318) - Debug flag actually changes logging level to debug for pdf2txt.py and
dumppdf.py (#325)
Changed
- Using argparse instead of getopt for command line interface of dumppdf.py (#321)
- Refactor
LTLayoutContainer.group_textboxes
for a significant speed up in layout analysis (#315)
Removed
- Files for external applications such as django, cgi and pyinstaller (#314)