Baotong Zhuang's starred repositories
stream-lib
Stream summarizer and cardinality estimator.
MyMediaLite
recommender system library for the CLR (.NET)
unsupervised-language-identification
An unsupervised language identification algorithm in Ruby, built originally for detecting English-language tweets.
gradient-svd
A simple SVD + LSI implementation in Ruby, based on gradient descent. Useful if you have a *small* matrix with missing values.
prediction-strength
An implementation of the prediction strength algorithm from Tibshirani, Walther, Botstein, and Brown's "Cluster validation by prediction strength".
gap-statistic
An implementation of the gap statistic algorithm to compute the number of clusters in a set of numerical data.
archived_madlib
MADlib has moved to Apache MADlib (incubating). Please send pull requests to the Apache repository.
scikit-learn
scikit-learn: machine learning in Python
elephant-bird
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
Play-Econometrics-with-R
a brochure about "Play Econometrics with R"
hadoop-lzo
Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
emacs-starter-kit
[ARCHIVED] this is ancient history