On rare occasions, page rendering and text extraction caused interpreter segmentation faults.
When fonts on a page (illegally) were meeting all of the following conditions:
- a non-UTF8 encodable name
- not embedded in the PDF
- using
Identity-H / Identity-V
ancoding
Then MuPDF reacted with appropriate warning messages (which are intercepted by PyMuPDF).
In prior versions, PyMuPDF did not expect non-UTF8 messages and used a too "optimistic" char to unicode conversion method, which brought down the interpreter.
This has changed in the new version.