mgitre / newsdiffs3

Logs changes of articles from major news outlets

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

newsdiffs3

A project based on ecprice's newsdiffs.

Setup

Requires access to a MongoDB server

Copy config_template.yaml to config.yaml.

Configure access to your database.

Running scraper.py scrapes articles from NYTimes, Washington Post, and APNews. I recommend attaching it to a cronjob.

server.py serves the web frontend, letting you view articles. Keep this running.

Usage

UI Navigating to http://[newsdiffs_location]/article/[article_url] will show any saved changes for that article. Using the menu on the right, you can select a single saved version to view or select two saved versions to compare.

Adding support for a news outlet

Adding a scraper can be done by copying the format from an existing one and replacing matches with ones that apply to your site. All matches are formatted for BeautifulSoup. Please feel free to contribute!

Built with

  • Python
    • Flask
    • BeautifulSoup
  • MongoDB

About

Logs changes of articles from major news outlets

License:GNU General Public License v3.0


Languages

Language:Python 51.4%Language:HTML 41.2%Language:CSS 7.4%