Xiangqian Lee's repositories
K-meansCluster
A java implementation of k-means algorithm.It uses ball tree as internal data structure to accelerate the computation.It uses 2-norm distance to compute the similarity between instances.
LilyPinyinInputMethod
A java implemented Pinyin Input method.It is a team project for my natural language processing course which I am the leader.It now uses a pinyin table downloaded at(http://www.datatang.com/data/11858) and a reduced corpus from Sogou Corp. which you can download at (http://www.sogou.com/labs/dl/c.html) .Have fun!
PYSeparator
Separate a String to syllybles(single legal Pinyin) most likely to possible intension of users.You should provide the function of a dictionary by implementing the dict interface.
ChineseWordSegmentation
Segment Chinese sentences into separated words.
DrawingLottery
This is necessary for you when a year ends and the next starts.
HiddenMarkovModel
A simple implentation of HMM.
Naive-Bayes-Classifier
A simple implemantation by Octave of naive Bayes Classifier,with a hypothesis that the continuous values associated with each class are distributed according to a Gaussian distribution.
skyline0623.github.io
My blog built by hexo.
SpellingCorrector
Based on the blog "How to Write a Spelling Corrector" by Peter Norvig,I implement a Java version of it.Further work will be focused on improving the error model.This implementation has a assumption that the word with an edit distance of 1 to the oringinal is more likely than the one with distance of 2.That may not be true for many cases.