yinxusen's repositories
docker-predictionio
Run PredictionIO inside Docker
sparksql-perf
A performance benchmark for Spark SQL.
hadoop-common
Mirror of Apache Hadoop common
OpenDL
The Deep Learning training framework on Spark
BIDMat
A CPU and GPU-accelerated matrix library for data mining
caffe
Caffe
incubator-spark
Mirror of Apache Spark
scala
The Scala programming language
sparrow
Sparrow scheduling platform (U.C. Berkeley).
chalk-1
Chalk is a natural language processing library.
summingbird
Streaming MapReduce with Scalding and Storm
rose
ROSE is an Obscure Scheme Evaluator
vowpal_wabbit
John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm
graphlab
A framework for large-scale machine learning and graph computation.
breeze
Breeze is a library for numerical processing, machine learning, and natural language processing. Its primary focus is on being generic, clean, and powerful without sacrificing (much) efficiency. Breeze is the merger of the ScalaNLP and Scalala projects, because one of the original maintainers is unable to continue development. The Scalala parts are largely rewritten.
svinet
This package implements algorithms for identifying overlapping communities in large undirected networks. The sampling based algorithms derive from stochastic variational inference under the (assortative) mixed-membership stochastic blockmodel. For details see the following reference: http://www.pnas.org/content/early/2013/08/14/1221839110.full.pdf
selfutils
Some small utilities for self use.
numpy
Numpy main repository
scipy
Scipy main repository
HiBench
HiBench is a Hadoop benchmark suite.
nativetask
native task
pagerank
A pagerank implementation in C++ able to handle very big graphs
dpark
Python clone of Spark, a MapReduce alike framework in Python
hadoop-twitter-pagerank
Hadoop Example - A naive PageRank implementation for Twitter dataset