Lipairui's repositories
Classification
Classification model including XGBoost, GBDT, RandomForest, lightGBM, stacking model...
Text-similarity-WMD_enhanced
Compute text similarity using Word Mover's Distance algorithm (Enhanced)
CSV-to-Neo4j
Convert csv data to Neo4j Graph Database.
Text-similarity-WMDsimilarity
Compute text similarity by using gensim wmdsimilarity
Semantic-Travel-Distance
A novel automatic evaluation metric for Machine Translation based on word embeddings.
Text-semantic-similarity
Calculate semantic similarity of two texts. Models include word2vec, tfidf, lda, lsi.
Text-similarity-centroid-of-the-word-vectors
Compute text similarity by calculating the cosine similarity of document vectors (Centroid of word vectors)
Animated-bar-plot
Generate animated bar plot in GIF format.
ChatBotCourse
自己动手做聊天机器人教程
Clean-text
Text preprocess, remove useless content including html, url...
Deal_with_Imbalance
Deal with imbalanced dataset. Utilizing over sampling, down sampling and combined sampling.
Feedback_filter
Filter noise feedback, text classification combining LSTM, TFIDF stacking, XGBoost, word2vec, LDA, LSI...
flask
The Python micro framework for building web applications.
Glove2word2vec
Transform glove format to word2vec format
MT_evaluation
Automatic evaluate Machine Translation(MT) results based on Google's Universal Sentence Encoder.
New_words_find
Automatically extract words of a corpus.
QASystemOnMedicalKG
A tutorial and implement of disease centered Medical knowledge graph and qa system based on it。知识图谱构建,自动问答,基于kg的自动问答。以疾病为中心的一定规模医药领域知识图谱,并以该知识图谱完成自动问答与分析服务。
Text_classification
Useful model API for text classification including tfidf, lda, lsi, word2vec, lstm...