This PHP library allow you to crawl a website and return all links. It supports resuming and you can also index data from HTML to a solr core (see configuration).
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool