A text analysis of Federal websites
See requirements.txt for the libraries this uses.
For nltk, you'll need to download the stopwords copora. Open a Python console and do the following:
import nltk nltk.download("stopwords")
scrapy crawl tutorialspider
scrapy crawl tutorialspider -o items.json