nathanjordan / bernstein

Bernstein is a tool for journalists to gain insight into the rapidly expanding online news medium. Given a list of seed websites, the tool will crawl these sites and determine various metrics about the sites, including how they are connected.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Bernstein

Bernstein is a tool for journalists to gain insight into the rapidly expanding online news medium. Given a list of seed websites, the tool will crawl these sites and determine various metrics about the sites, including how they are connected.

Requirements

  • Python 2.7+
  • Neo4j Server 2.01+

Installation

git clone https://github.com/nathanjordan/bernstein.git
cd bernstein
pip install -r requirements.txt

Running

python bernstein/crawler.py

Bitdeli Badge

About

Bernstein is a tool for journalists to gain insight into the rapidly expanding online news medium. Given a list of seed websites, the tool will crawl these sites and determine various metrics about the sites, including how they are connected.


Languages

Language:Python 100.0%