Added
--urlFile
browsertrix crawler parameter--depth
browsertrix crawler parameter--extraHops
, parameter--collection
browsertrix crawler parameter--allowHashUrls
browsertrix crawler parameter--userAgentSuffix
browsertrix crawler parameter--behaviors
, parameter--behaviorTimeout
browsertrix crawler parameter--profile
browsertrix crawler parameter--sizeLimit
browsertrix crawler parameter--timeLimit
browsertrix crawler parameter--healthCheckPort
, parameter--overwrite
parameter
Changed
- using browsertrix-crawler
0.6.0
and warc2zim1.4.2
- default WARC location after crawl changed
fromcollections/capture-*/archive/
tocollections/crawl-*/archive/
Removed
--scroll
browsertrix crawler parameter (see--behaviors
)--scope
browsertrix crawler parameter (see--scopeType
,--include
and--exclude
)