ebensing / gov2430lab1

Government 2430, Lab 1 (Fall 2013), Analyzing Enron Data

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Lab 1 - Enron Data Analysis

This is repo for the first lab in Gov 2430 (Fall 2013).

Setup

  1. Download and unpack the "Metadata by Users - CSV files, one for each User" from here
  2. The .csv files need to live in the same directory as process.py

Usage

./process.py - spits out the top 30 most common words and create a file sortedCounts.txt that contains a full, sorted list of all words

./process.py FILENAME - this will read in FILENAME (which contains words, one per line) and spit out the top 10 users for each word

About

Government 2430, Lab 1 (Fall 2013), Analyzing Enron Data


Languages

Language:Python 100.0%