github docling-project/docling v2.33.0

latest releases: v2.50.0, v2.49.0, v2.48.0...
3 months ago

Feature

  • Add textbox content extraction in msword_backend (#1538) (12a0e64)

Fix

  • Fix issue with detecting docx files, and files with upper case extensions (#1609) (f4d9d41)
  • Load_from_doctags static usage (#1617) (0e00a26)
  • Incorrect force_backend_text behaviour for VLM DocTag pipelines (#1371) (f2e9c07)
  • pypdfium: Resolve overlapping text when merging bounding boxes (#1549) (98b5eeb)

Don't miss a new docling release

NewReleases is sending notifications on new releases.