Will's repositories
-
收集整理 GitHub 上高质量、有趣的开源项目。
ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
char-rnn
Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch
ChatBotCourse
自己动手做聊天机器人教程
Chinese
Tools and resources for Chinese texts preprocessing. Validated in two papers, one CCF C, EI indexing and one CCF B, SCI indexing.
cws_evaluation
Java开源项目cws_evaluation:中文分词器分词效果评估对比
elasticsearch-analysis-ik
The IK Analysis plugin integrates Lucene IK analyzer into elasticsearch, support customized dictionary.
fastText
Library for fast text representation and classification.
HanLP
自然语言处理 中文分词 词性标注 命名实体识别 依存句法分析 关键词提取 自动摘要 短语提取 拼音 简繁转换
interviews
Everything you need to know to get the job.
jieba
结巴中文分词
jieba-analysis
结巴分词(java版)
MachineLearning_Python
机器学习算法python实现
MLAlgorithms
Minimal and clean examples of machine learning algorithms
mmseg4j-core
mmseg4j core MMSEG for java chinese analyzer
mycroft-precise
A lightweight, simple-to-use, RNN wake word listener
neural-storyteller
A recurrent neural network for generating little stories about images
pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,Kenlm,ConvSeq2Seq,BERT,MacBERT,ELECTRA,ERNIE,Transformer,T5等模型实现,开箱即用。
react-native-markdown-display
React Native 100% compatible CommonMark renderer
react-native-webview
React Native Cross-Platform WebView
react-native-youtube-iframe
A wrapper of the Youtube-iframe API built for react native.
Synonyms
中文近义词工具包
tf-idf-keyword
基于特定语料库的TD-IDF的中文关键词提取
THULAC-Java
An Efficient Lexical Analyzer for Chinese
tvs-tools
文档及开发评测脚本
weixin-java-tools
全能微信Java开发工具包,支持包括微信支付、开放平台、小程序、企业微信/企业号和公众号等的开发
word
Java分布式中文分词组件 - word分词