github crwlrsoft/crawler v0.3.0

latest releases: v1.9.3, v1.9.2, v1.9.1...
pre-release2 years ago

Added

  • By calling monitorMemoryUsage() you can tell the Crawler to add log messages with the current memory usage after every step invocation. You can also set a limit in bytes when to start monitoring and below the limit it won't log memory usage.

Fixed

  • Previously the use of Generators actually didn't make a lot of sense, because the outputs of one step were only iterated and passed on to the next step, after the current step was invoked with all its inputs. That makes steps with a lot of inputs bottlenecks and causes bigger memory consumption. So, changed the crawler to immediately pass on outputs of one step to the next step if there is one.

Don't miss a new crawler release

NewReleases is sending notifications on new releases.