khandelwal / fedtext

A text analysis of Federal websites

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

fedtext

A text analysis of Federal websites

Installation

See requirements.txt for the libraries this uses.

For nltk, you'll need to download the stopwords copora. Open a Python console and do the following:

import nltk nltk.download("stopwords")

Sample Run Commands

  • scrapy crawl tutorialspider
  • scrapy crawl tutorialspider -o items.json

About

A text analysis of Federal websites

License:Creative Commons Zero v1.0 Universal


Languages

Language:Python 100.0%