DouglasKrouth / PDAP-Scrapers

Code relating to scraping public police data.

Home Page:https://pdap.io

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Police Data Accessibility Project Scrapers

This repo contains the data scrapers for Police Data Accessibility Project. Thank you for being here!

Scrapers are one piece of the puzzle of making police data more accessible. For a broader look at the work we're doing, and where our attention is focused, start with the docs: https://docs.pdap.io/ and the roadmap: https://github.com/orgs/Police-Data-Accessibility-Project/projects/17

How to run a scraper

Right now, this requires some Python knowledge and patience. We're in the early stages: there's no automated scraper farm or fancy GUI yet.

  1. Install Python.
  2. Clone this repo.
  3. Find the scraper you wish to run. These are sorted geographically, so start by looking in /USA/....
  4. Run the scraper.py file with something like python3 <scraper path> depending on how you installed it.

Did it work?

If it worked, discuss your findings in our Discord. If it didn't, make an issue in this repo or reach out in Discord.

How to contribute

To write a scraper, start with CONTRIBUTING.md. Be sure to check out the /common folder!

For everything else, start with docs.pdap.io.

What data are we scraping?

The data sources listed here are our to-do list. If we should targeting a new data type, suggest it in Discord or make a DoltHub PR!

Resources

Potentially useful tools. If you find something useful, or if one of these is out of date, make a PR!

About

Code relating to scraping public police data.

https://pdap.io

License:GNU General Public License v3.0


Languages

Language:Python 97.2%Language:JavaScript 2.2%Language:Dockerfile 0.5%Language:Makefile 0.1%