gregce / W251-TwitterLanguage

Twitter Spritzer Archive Analysis for MIDS W251

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Twitter Spritzer Language Exploration

For MIDS W251, Summer 2016

The folder structure for this repository is as follows:

  • Configuration: Details instructions used to set up our 9 Ubuntu nodes
  • Storage: Provides loading details + DDL for our cassandra key stores
  • Exploration: Source code for data extraction and exploration in python and R
  • Visualization: Final datafile + .Rmd file used to visualize our results
  • Project Assets: Hard copies (pdfs) of our final project presentation, whitepaper & screenshots of our running cluster

Tweet Language Explorer

Accessible here http://192.155.215.11:3838/w251/

Public version of this source code

This repo is a public version of the source code for the Twitter Lang app. The final app is built with R, JavaScript and Shiny and is based on data initially loaded and sourced from a 9 node cassandra cluster. If you got this far, we have to assume you can look after yourself with R and navigating this GitHub repo; we can’t give detailed guidance or help.

If you do want to launch the final version of the viz, the massaged and cleaned data ultimatley used can be downloaded directly here as an .Rda file.

Long after this course has ended and our machines used to conduct this analysis are reclaimed, the app will persist here on shinyapps.io

About

Twitter Spritzer Archive Analysis for MIDS W251


Languages

Language:Jupyter Notebook 59.3%Language:R 28.0%Language:Python 10.3%Language:Shell 2.4%