sunmanli's repositories
automated-feature-engineering
Automated feature engineering in Python with Featuretools
jieba
结巴中文分词
spark-scala-tutorial
A free tutorial for Apache Spark.
learning-nlp
nlp in action
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
spark
Mirror of Apache Spark
spark-doc-zh
Apache Spark 官方文档中文版
xgboost-doc-zh
XGBoost 中文文档
THUCKE
THU Chinese Keyphrase Extraction Toolkit
gobook
The Go Programming Language
mlpack
mlpack: a scalable C++ machine learning library
dlib
A toolkit for making real world machine learning and data analysis applications in C++
THULAC-Python
An Efficient Lexical Analyzer for Chinese
scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
caffe
Caffe on both Linux and Windows
clstm
A small C++ implementation of LSTM networks, focused on OCR.
list
链表
compare
compare embedding
MachineLearning
Basic MachineLearning algorithm
word2vec
This tool provides an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words. These representations can be subsequently used in many natural language processing applications and for further research.