obins's repositories
serengeti-pantry
Cookbooks and roles used by Serengeti
hadoop-docker
Hadoop docker image
Font-Awesome
The iconic font and CSS toolkit
SparkOnHBase
SparkOnHBase
spark-getting-started
Source code from an affiliated blog post, the first part of which can be found at http://data-scientist-in-training.blogspot.com/2015/02/getting-started-with-spark-kitchen-sink.html.
datasci
datasci course stuffs
2014
Official content for the Fall 2014 Harvard CS109 Data Science course
stat-learning
Notes and exercise attempts for "An Introduction to Statistical Learning"
swirl_courses
A collection of interactive courses for the swirl R package.
ProgrammingAssignment2
Repository for Programming Assignment 2 for R Programming on Coursera
hadoop-tutorials-examples
Source, data and turotials of the blog post video series of Hue, the Web UI for Hadoop.
virtual-hadoop-cluster
A virtual Hadoop cluster running CDH5
allstate
Kaggle's Allstate Purchase Prediction Challenge
cdh5-vagrant
Ready-to-use, manually tuned Cloudera Hadoop Distribution 5 provisioned cluster
sklearn_pycon2014
Repository containing files for my PyCon 2014 scikit-learn tutorial.
ExData_Plotting1
Plotting Assignment 1 for Exploratory Data Analysis
twitbase-async
Example asynchbase application for HBase in Action
datasharing
The Leek group guide to data sharing
twitbase
TwitBase is a running example used throughout HBase In Action
twitbase.py
Example application demonstrating Thrift + Python for HBase in Action.
gis
GIS examples developed in Chapter 8 of HBase In Action
ml-examples-by-scalala
Machine Learning Algorithms Samples By Scalala