Bernstein
Bernstein is a tool for journalists to gain insight into the rapidly expanding online news medium. Given a list of seed websites, the tool will crawl these sites and determine various metrics about the sites, including how they are connected.
Requirements
- Python 2.7+
- Neo4j Server 2.01+
Installation
git clone https://github.com/nathanjordan/bernstein.git
cd bernstein
pip install -r requirements.txt
Running
python bernstein/crawler.py