eklem / browsercrawler

Crawling content from a site within the browser. A basis for i.e. a search solution for static sites.

https://eklem.github.io/browsercrawler/doc/

Obey robots.txt

eklem opened this issue 7 years ago · comments

Espen Klem commented 7 years ago

Get robots.txt, and check before each URL is crawled.

Espen Klem commented 6 years ago

Will be own server. Could be nice to have down the road, but not now.