Jerry-zhxf / EnronAnalysis

This is a compilation of data visualization and machine learning algorithms applied on the FERC Enron Dataset.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

EnronAnalysis

This is a compilation of data visualization and machine learning algorithms applied on the FERC Enron Dataset. The R programming section deals with data cleaning and visualization techniques I've learned from RPubs and Analytics Edge; credit to them. Comments in the files will help you discern the uses of various commands and functions.

Using python, machine learning algorithms were applied to test a predictor model in identifying a Person of Interest. It currently works only on identfying if the emails were authored by Chris or Sara. Further work is required for a full fledged predictor model.

About

This is a compilation of data visualization and machine learning algorithms applied on the FERC Enron Dataset.

License:GNU Affero General Public License v3.0


Languages

Language:R 74.1%Language:Python 25.9%