Clayton Kim's starred repositories
first-stories-twitter
How to spot first stories on Twitter using Storm.
scalding-knn
k-nearest-neighbors in scalding.
scalding-nb
Naive Bayes classifier written in Scalding
superconductor
Big data visualization on the web
naive-bayes-classifier-scala
A Naive Bayes Classifier in Scala
Locality-Sensitive-Hashing
A Scala library for locality sensitive hashing
scikit-sos
A Python implementation of the Stochastic Outlier Selection algorithm
data-science-at-the-command-line
Data Science at the Command Line
BuildingMachineLearningSystemsWithPython
Source Code for the book Building Machine Learning Systems with Python
LearnDataScience
Open Content for self-directed learning in data science
Python-Numerics
Numerical machines in Python
Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)