Vincent's repositories
CoolplaySpark
酷玩 Spark: Spark 源代码解析、Spark 类库、Spark 代码等
data-science-ipython-notebooks
Continually updated data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe), scikit-learn, Kaggle, Spark, Hadoop MapReduce, HDFS, matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines. https://bit.ly/data-notes
canal
阿里巴巴mysql数据库binlog的增量订阅&消费组件
kylin-mondrian-interaction
Some information about Apache Kylin interaction with Pentaho Mondrian
CAPTCHA-breaking
DataCastle验证码识别大赛
phoenix
Mirror of Apache Phoenix
awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
MachineLearning
Literature Study
FeatureFu
Library and tools for advanced feature engineering
StockInference-Spark
Stock inference engine using Spring XD, Apache Geode / GemFire and Spark ML Lib.
mptun
Multi-path Tunnel
bi-platform
提供报表和OLAP服务的敏捷BI平台
keras
Theano-based Deep Learning library (convnets, recurrent neural networks, and more).
HanLP
汉语言处理包 中文分词 词性标注 命名实体识别 依存句法分析 关键词提取 自动摘要 短语提取 拼音 简繁 Lucene
BatchImageProcessor
A Mass Image Processing tool for Windows
spaCy
Industrial strength NLP with Python and Cython
d3
A JavaScript visualization library for HTML and SVG.
Big-Data-Resources
:boom:大数据/数据挖掘/推荐系统/机器学习相关资源
deeplearning-class-2011
Code for Deep Learning class at Google