invernizzi / js-crawler

A short and simple python crawler, that uses Webkit and executes Javascript

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

js-crawler

A short and simple web crawler written in Python, that uses Webkit and executes Javascript.

How to use

crawler = Crawler(gui=True,                                                 # To see the crawler in action
                  is_link_interesting=lambda url, text: 'download' in url)  # Follow every link containing
                                                                            #  "download" in the url
crawler.crawl('http://firefox.com')
crawler.close()

About

A short and simple python crawler, that uses Webkit and executes Javascript


Languages

Language:Python 100.0%