github datalab-to/marker v1.6.0
Support word, powerpoint, excel, html, epub + math improvements

latest releases: v1.10.2, v1.10.1, v1.10.0...
15 months ago

Support xlsx, docx, pptx, html, epub

Marker now has support for additional document formats. You have to run pip install marker-pdf[full] to install all the dependencies.

Improved text detection

OCR should now work better due to an improved text detection model.

Inline math improvements

  • Better inline math detection with an improved model.
  • Inline math lines are now inference.
  • --redo-inline-math option to enable the highest quality math detection

Misc improvements

  • Support for the claude model
  • Improve benchmarking scripts
  • Merge lines better with new text detection model

What's Changed

New Contributors

Full Changelog: v1.5.5...v1.6.0

Don't miss a new marker release

NewReleases is sending notifications on new releases.