eklem / browsercrawler

Crawling content from a site within the browser. A basis for i.e. a search solution for static sites.

Home Page:https://eklem.github.io/browsercrawler/doc/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Obey robots.txt

eklem opened this issue · comments

Get robots.txt, and check before each URL is crawled.

Will be own server. Could be nice to have down the road, but not now.