github jorisschellekens/borb v2.1.7

latest releases: v2.1.25, v2.1.24, v2.1.23...
23 months ago

📣 borb release notes

This release is a feature release.

  • Introducing SimpleFindReplace which enables you to find and replace text in a PDF
    • For examples on how to use it, check the examples repository
    • You can also check the tests directory in this project

This release fixes inconsistencies in toolkit

Most of the classes in the following table implement EventListener and are part of the package toolkit.
They have a class method (which you can call if you instantiate them and add them as EventListener to a PDF).
They also have a static method that you can call. The class method and static method typically return the same type/thing.

The static method has the advantage that it allows you to work with a Document, whereas the class method only works with a PDF that is being loaded.
Or, to put it simply, the static method can be used at any point in the life-cycle of Document, whereas the class method can only be used when reading an existing PDF.

This table gives you an overview of the available classes in toolkit and their methods:

class class method static method status
ColorExtraction get_color get_color_from_pdf ✔️
FontExtraction get_fonts
FontExtraction get_font_names
HTMLToPDF convert_html_to_layout_element ✔️
HTMLToPDF convert_html_to_pdf ✔️
ImageExtraction get_images get_images_from_pdf ✔️
MarkdownToPDF convert_markdown_to_layout_element ✔️
MarkdownToPDF convert_markdown_to_pdf ✔️
PDFToJPG convert_to_jpg convert_pdf_to_jpg ✔️
PDFToMP3 convert_to_mp3 convert_pdf_to_mp3 ✔️
PDFToSVG convert_to_svg convert_pdf_to_svg ✔️
RegularExpressionTextExtraction get_matches get_matches_for_pdf ✔️
SimpleLineOfTextExtraction get_lines_of_text get_lines_of_text_from_pdf ✔️
SimpleNonLigatureTextExtraction get_text get_text_from_pdf ✔️
SimpleParagraphExtraction get_paragraphs get_paragraphs_from_pdf ✔️
SimpleTextExtraction get_text get_text_from_pdf ✔️
TableDetectionByLines get_tables
TableDetectionByLines get_table_bounding_boxes
TextRankKeywordExtraction get_keywords get_keywords_from_pdf ✔️
TFIDFKeywordExtraction get_keywords get_keywords_from_pdf ✔️

Don't miss a new borb release

NewReleases is sending notifications on new releases.