How to parse pages with HTTP errors (403, 404)
n-studio opened this issue · comments
I'm using mechanize
.
Some pages with errors (403, 404...) still display valuable information, but the scraper will just retry or skip the page. Is there a way to treat error pages just like 200 pages?
I fixed my issue by configuring Capybara.
Capybara.configure do |config|
config.raise_server_errors = false
end