HarvardOpenData / propublica-data

Python script that uses the Propublica API to retrieve data (in csv format) on any list of organizations.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Harvard Open Data Project: Student Club Finances

Propublica scripts

A python script that uses the Propublica API to retrieve data (in csv format) on any list of non-profit US organizations. To run, simply type python propublica.py in your terminal. The data will be outputted as "finaldata.csv".

Currently the script is extracting data on final clubs and other student organizations at Harvard. To look up data on other organizations, simply look for their propublica numbers and edit the "listoforgs.csv" file.

Due to speed of propublica's API, each organization will take around 2-5 seconds resulting in a total of a couple of minutes depending on the number of organizations you input.

To find out more, go to Propublica's Nonprofit Explorer API here: https://projects.propublica.org/nonprofits/api

Getting started

We use Jupyter (also known as IPython) for quick prototyping and testing.

To install, just run pip install jupyter in your terminal. See here for more information.

Running

To open the Jupyter notebooks, run jupyter notebook in your terminal. A browser tab will pop open; click on a notebook to open it.

To run our final script, use python propublica.py.

About

This is a project of the Harvard Open Data Project, a student-faculty collaboration dedicated to opening and analyzing Harvard data to empower our community members to improve campus life.

About

Python script that uses the Propublica API to retrieve data (in csv format) on any list of organizations.

License:MIT License


Languages

Language:Python 75.4%Language:Jupyter Notebook 24.6%