alex's repositories
glint
Glint: High performance scala parameter server
ps-lite
A lightweight parameter server interface
NewsRec
rank
sparkling-water
Sparkling Water provides H2O functionality inside Spark cluster
lightlda
Scalable, fast, and lightweight system for large-scale topic modeling
trace-analysis
Scripts to analyze Spark's performance
CTR-estimator
LR and FM (with sgd or ftrl) model
flow
Adds static typing to JavaScript to improve developer productivity and code quality.
vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
DistML
DistML provide a supplement to mllib to support model-parallel on Spark
factorie
FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a succinct language for creating relational factor graphs, estimating parameters and performing inference.
spark-MDLP-discretization
Spark implementation of Fayyad's discretizer based on Minimum Description Length Principle (MDLP)
graphchi
Automatically exported from code.google.com/p/graphchi
breeze
Breeze is a numerical processing library for Scala.
myrrix-recommender
Automatically exported from code.google.com/p/myrrix-recommender
scalalab
ScalaLab: Efficient MATLAB like scientific computing for the Java platform
zen
Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logistic regression, latent dirichilet allocation, factorization machines and DNN.
simhash
中文文档simhash值计算
spark-FM-parallelSGD
Implementation of Factorization Machines on Spark using parallel stochastic gradient descent (python and scala)
spark
Mirror of Apache Spark
SparkInternals
Notes talking about the design and implementation of Apache Spark
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, C++ and more
libfm
Library for factorization machines
DeepLearningTutorials
Deep Learning Tutorial notes and code. See the wiki for more info.
modelmatrix
Sparse feature extraction with Spark
fb.resnet.torch
Torch implementation of ResNet from http://arxiv.org/abs/1512.03385 and training scripts
streamDM
Stream Data Mining Library for Spark Streaming
Kaggler
Code for Kaggle Data Science Competitions
spark-libFM
An implement of Factorization Machines (LibFM)