palewire / sopr-contribs

Scripts for processing and analyzing federal lobbyist disclosure data reporting contributions to political campaigns

Home Page:http://www.palewire.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

A script that fetches, parses and archives the XML data dumps of lobbyist's
political contributions published by The Senate Office of Public Records.
 
Zips files containing the XML are:
1. Downloaded and unzipped.
2. Parsed out into flat text files and stored in a timestamped folder structure.
3. Imported to a SQLite database.
 
The ultimate goal is for a series of SQL statements to scrub and cut the data
to account for flaws in the reporting system first uncovered by Bill Allison
and Anupama Narayanswamy of The Sunlight Foundation.

About

Scripts for processing and analyzing federal lobbyist disclosure data reporting contributions to political campaigns

http://www.palewire.com


Languages

Language:Python 100.0%