Tsangares / congress_military

Crawl CQPress for the duration in which congress people were in the military.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Summary

This is a crawler and parser for CQPress. They have parsed the biography data from https://bioguide.congress.gov/ to retrieve information about congress people.

Crawler

The file crawler.ipynb is notebook that contains a basic crawler using requests to retrieve html files.

Parser

The file parse_people.ipynb is a notebook where I combine filter and parse the people for critical characteristics.

About

Crawl CQPress for the duration in which congress people were in the military.


Languages

Language:HTML 87.0%Language:Jupyter Notebook 13.0%