github pdfminer/pdfminer.six 20221105

latest release: 20231228
18 months ago

Added

  • Output converter for the hOCR format (#651)
  • Font name aliases for Arial, Courier New and Times New Roman (#790)
  • Documentation on why special characters can sometimes not be extracted (#829)

Fixed

  • ValueError when bmp images with 1 bit channel are decoded (#773)
  • ValueError when trying to decrypt empty metadata values (#766)
  • Sphinx errors during building of documentation (#760)
  • TypeError when getting default width of font (#720)
  • Installing typing-extensions on Python 3.6 and 3.7 (#775)
  • TypeError in cmapdb.py when parsing null characters (#768)
  • Color "convenience operators" now (per spec) also set color space (#794)
  • ValueError when extracting images, due to breaking changes in Pillow (#827)
  • Small typo's and issues in the documentation (#828)

Deprecated

  • Usage of if __name__ == "__main__" where it was only intended for testing purposes (#756)

Don't miss a new pdfminer.six release

NewReleases is sending notifications on new releases.