- De-vendorize Readability, use the freshly-minted
@mozilla/readability
npm package instead; (thanks @gijsk!) - Sanitize the metadata extracted with Readability with DOMPurify
- Make slugify() stricter about which characters to leave in the file name
- Configure Puppeteer to produced tagged PDF files (See #47)
- Accept the
file://
protocol, and absolute/relative paths to files on disk (See #34) - Make JSON-LD extraction handle more cases
- Log, but don't break on, invalid
srcset
attributes