github apify/crawlee-python v0.0.5
0.0.5

latest releases: v1.0.0rc1, v0.6.12, v0.6.11...
14 months ago

Adds

  • Add explicit error messages for missing package extras during import
  • Better browser abstraction:
    • BrowserController - Wraps a single browser instance and maintains its state.
    • BrowserPlugin - Manages the browser automation framework, and basically acts as a factory for controllers.
  • Browser rotation with a maximum number of pages opened per browser.
  • Add emit persist state event to event manager
  • Add batched request addition in RequestQueue
  • Add start requests option to BasicCrawler
  • Add storage-related helpers get_data, push_data and export_to to BasicCrawler and BasicContext
  • Add PlaywrightCrawler's enqueue links helper

Fixes

  • Fix type error in persist state of statistics

Don't miss a new crawlee-python release

NewReleases is sending notifications on new releases.