Eclipseeeee's starred repositories
HarvestText
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
TX-WORD2VEC-SMALL
腾讯word2vec模型缩小版
cosine_similarity_tfidf_nltk
calculate tfidf and cosine similarity using nltk
NLP-Model-for-Corpus-Similarity
A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
tf-idf-python
Term frequency–inverse document frequency for Chinese novel/documents implemented in python.