Improved
- Add profiler for performance debugging
- Performance optimizations for math and content patterns (#212)
- Footnotes: Alternate aside style, inline improvements, false positive fixes, loose footnotes, and HTML named anchor footnotes
- Code blocks: More syntax highlighting patterns, Chroma and CodeMirror support
- Improve unique author filtering and deduplication
- Extract authors and dates from cover elements
- YouTube: Add timeouts, fallbacks, fix stale metadata after SPA navigation (#174)
- YouTube Shorts handling (#206)
- YouTube: Respect preferred transcript language (#202)
- Reddit: Remove duplication, fix author extraction when comments haven't loaded (#204)
- Tailwind: Improve patterns for footnotes, metadata, and removals
- Content pattern removals for newsletters, related posts, breadcrumbs
- Extract BBcode formatting
- Substack extractor (#216)
- Honor proxy settings (#165)
Fixes
- Fix og:title brand name used as article title (#196)
- Fix MathJax SVG / MathML-only math rendering (#201)
- Fix main content embedded into figure elements
- Fix flex-row line gutters and invalid
code>prenesting in code blocks - Fix buttons appearing in code blocks
- Fix X author fallback on status URLs (#208)
- Fix charset parsing for quoted and trailing-comma values
- Fix proper description metadata extraction (#198)
- Fix bad author strings from broken CMS templates (#207)
- Preserve text when footnote reference is wrapped around reference