github watercrawl/WaterCrawl v0.2.0
Release v0.2.0

latest releases: v0.10.2, v0.10.1, v0.10.0...
7 months ago

[0.2.0] - 2025-01-15

Added

  • Integrated Playwright for dynamic page rendering and JavaScript execution
  • Support for PDF and Screenshot attachments for crawl results
  • Advanced page interaction options (wait time, cookie acceptance, locale settings)
  • Improved Docker build process with multi-platform support
  • Added API version endpoint
  • Extended crawler options with timeout, cookies, locale, and headers support
  • Duration tracking for crawl requests
  • Support for longer URLs (up to 2048 characters)

Changed

  • Enhanced page rendering with Playwright middleware
  • Improved JavaScript handling and dynamic content extraction
  • Enhanced Docker workflow with better caching and versioning
  • Improved domain handling in spider options
  • Updated concurrent request settings
  • Better organization of crawler constants and types

Infrastructure

  • Added multi-platform Docker builds (linux/amd64, linux/arm64)
  • Improved Docker caching and build optimization
  • Added version tracking in Docker builds

Don't miss a new WaterCrawl release

NewReleases is sending notifications on new releases.