github crwlrsoft/crawler v1.1.5

latest releases: v1.7.2, v1.7.1, v1.7.0...
10 months ago

Fixed

  • The Http::crawl() step, as well as the Html::getLink() and Html::getLinks() steps now ignore links, when the href attribute starts with mailto:, tel: or javascript:. For the crawl step it obviously makes no sense, but it's also considered a bugfix for the getLink(s) steps, because they are meant to deliver absolute HTTP URLs. If you want to get the values of such links, use the HTML data extraction step.

Don't miss a new crawler release

NewReleases is sending notifications on new releases.