crachmanin / cs6963Project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

cs6963Project

Currently we have a pagerank example for both hadoop and spark. Most of the pagerank code for hadoop was borrowed from https://github.com/danielepantaleone/hadoop-pagerank The spark pagerank is part of the spark examples that come preinstalled.

The spark logistic regression example is based off the psuedocode in the spark paper.

We also have scripts for installing hadoop and Spark, as well as one for clearing page caches. The excel file containing execution times for logistic regression is included as well

About


Languages

Language:Shell 40.3%Language:XSLT 21.8%Language:Java 17.5%Language:Scala 15.3%Language:CSS 4.5%Language:Python 0.6%