BruceXu's repositories
leaves
pure Go implementation of prediction part for GBRT (Gradient Boosting Regression Trees) models from popular frameworks
machine_learning_beginner
机器学习初学者公众号作品
pkuseg-python
python版本:领域细分的中文分词工具,简单易用,跟现有开源工具相比提高了分词的准确率。
analytics-zoo
Distributed Tensorflow, Keras and BigDL on Apache Spark
homemade-machine-learning
🤖 Python examples of popular machine learning algorithms with interactive Jupyter demos and math being explained
algorithms
Bug-tracking for Jeff's algorithms book, notes, etc.
lihang-code
《统计学习方法》的代码实现
Learning-from-data
记录Learning from data一书中的习题解答
spark-doc-zh
Apache Spark 官方文档中文版
Algorithm_Interview_Notes-Chinese
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
flink-china-doc
Flink 官方文档中文翻译项目 :cn:
awesome-algorithm
Leetcode 题解 (跟随思路一步一步撸出代码) 及经典算法实现
learning-tf-zh
:book: [译] TensorFlow 学习指南
gensim
Topic Modelling for Humans
SparkGBM
Spark-based GBM
fastText
Library for fast text representation and classification.
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
MUSE
A library for Multilingual Unsupervised or Supervised word Embeddings
StarSpace
Learning embeddings for classification, retrieval and ranking.
MovieTaster-Open
使用Item2Vec做电影推荐
THULAC-Python
An Efficient Lexical Analyzer for Chinese
xgboost-go
xgboost go wrapper for c_api
LearningSpark
Scala examples for learning to use Spark
kafka-doc-zh
Kafka 中文文档
TensorFlow-Examples
TensorFlow Tutorial and Examples for Beginners with Latest APIs
storm-doc-zh
Apache Storm 官方文档中文版
HanLP
自然语言处理 中文分词 词性标注 命名实体识别 依存句法分析 关键词提取 自动摘要 短语提取 拼音 简繁转换
Chinese
Tools and resources for Chinese texts preprocessing. Validated in two papers, one CCF C, EI indexing and one CCF B, SCI indexing.