Dry Run Mode

Question

Dry Run Mode

ikreymer opened this issue 2 months ago · comments

A 'dry run' mode (is that the best name?) can be used to run a crawl without storing any archive data. It can be used to examine the scope of crawl via logs / saved state, or, to delegate handling via a remote proxy, when used in conjunction with external proxies (see #587). The dry run mode should still fetch everything + run behaviors, but not write any local data.
Text extraction and screenshots should also be skipped.