xiangyu's repositories
LDA-Compiled
LDA code commpiled from internet
alloyeditor.com
Website of Alloy Editor
Chinese-Whispers
Chris Biemann Version 1.0, May 2006 http://wortschatz.informatik.uni-leipzig.de/~cbiemann/software/CW.html
flume-headers-to-avro-serializer
Serializer for build Avro file using Flume event headers
flume-ng-sql-source
Flume Source to import data from SQL Databases
LatticeWordSegmentation
Software to apply unsupervised word segmentation on lattices or text sequences using a nested hierarchical Pitman Yor language model
LightGBM
A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. It is under the umbrella of the DMTK(http://github.com/microsoft/dmtk) project of Microsoft.
nhpylm
Python bindings for a c++ based implementation of the Nested Hierarchical Pitman-Yor Language model
nl-practice
Repository for Practice
python-npycrf
条件付確率場とベイズ階層言語モデルの統合による半教師あり形態素解析
sgwater-dpseg
Unsupervised word segmentation using dirichlet process, imported from homepages.inf.ed.ac.uk/sgwater/resources.html
THULAC-Java
An Efficient Lexical Analyzer for Chinese
THULAC-Python
An Efficient Lexical Analyzer for Chinese
vpylm
VPYLMのC++実装