New in this release:
- use pymupdf to read ToC from pdf (if it exists in the pdf metadata)
- correct header levels and hierarchy based on this
- best effort attempt to:
- convert texts and list items to headers if they were parsed incorrectly and appear in the ToC
- convert header to text items if they were parsed incorrectly and do not appear in the ToC