zhaoxjmail's repositories
Viterbi
An implementation of HMM-Viterbi Algorithm 通用的维特比算法实现
elasticsearch-concatenate-token-filter
Elasticsearch plugin which only provides a TokenFilter that merges tokens in a token stream back into one. Taken from http://elasticsearch-users.115913.n3.nabble.com/Is-there-a-concatenation-filter-td3711094.html
wechat-api
🗯 wechat-api by java7.
Ngram
Ngram model is used in a wide variety of applications, aimed to supply a multifunctional ngram tools to developers
ChineseSpellingCheck
中文拼写检查工具,用于对中文文本中的错误用语进行检测并给出纠正建议
THULAC-Java
An Efficient Lexical Analyzer for Chinese
SpellingCorrector-Java8
A port of Peter Norvig's Spelling Corrector to Java 8
Ice-demo
基于 Zeroc Ice 3.6.1 的Android、iOS、Java、Javascript例子,Ice 源码地址https://github.com/zeroc-ice/ice
textfilter
敏感词过滤的几种实现+某1w词敏感词库
LOF-java
LOF is an outlier detecting algorithm.
Web-Spider
A simple web crawler that crawls from a seed URL to a depth of 5. Implemented the crawler using both DFS and BFS algorithm.
spellcorrect
A program to correct non-word spelling error in sentences using ngram MAP Language Models, Noisy Channel Model, Error Confusion Matrix and Damerau-Levenshtein Edit Distance.
QRCodeLoginDemo
二维码登录和二维码生成及解析工具
Word2VEC_java
word2vec java版本的一个实现
berkeleylm
Automatically exported from code.google.com/p/berkeleylm
regexp-trie
Regexp::Trie for Java7
hao
好东西传送门
kylm
The Kyoyo Language Modeling Toolkit
chinesesegmentor
CRFs based Chinese word segmentor
DoubleArrayTrie
double array trie with unicode support
Java-readability
A port of the arclabs 'readability' package to Java