PengYan's repositories

Familia

A Toolkit for Chinese Topic Modeling

Language:C++License:BSD-3-ClauseStargazers:0Issues:0Issues:0

LightGBM

A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. It is under the umbrella of the DMTK(http://github.com/microsoft/dmtk) project of Microsoft.

Language:C++License:MITStargazers:0Issues:0Issues:0

test

test github

Stargazers:0Issues:0Issues:0

ansj_seg

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0