github opendataloader-project/opendataloader-pdf v2.4.7
Release v2.4.7

3 hours ago

What's Changed

  • feat(hybrid): persist full-page DLA renders for evidence overlays by @bundolee in #529
  • feat(json): emit per-node ai_score + pdfua_tag, fix paragraph/table metadata holes by @bundolee in #530
  • feat(hybrid): expose DLA raw object_id on ElementMetadata for downstream consumers by @bundolee in #532
  • Update HTML output with layout attributes by @LonelyMidoriya in #524
  • Add font-size property to formatted html by @LonelyMidoriya in #533
  • Improve StrikethroughProcessor by @MaximPlusov in #534
  • Auto-tagging - Improve location of structure elements for annots by @MaximPlusov in #520
  • fix(auto-tagging): assign unique /ID to Note / FENote struct elements (PDF/UA-1 §7.9.1) by @bundolee in #535
  • fix(hybrid/docling): collapse spanning cells so docling tables pass PDF/UA §7.2 by @bundolee in #536

Full Changelog: v2.4.5...v2.4.7

Don't miss a new opendataloader-pdf release

NewReleases is sending notifications on new releases.