There are 20 repositories under text-similarity topic.
Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
搜索所有中文NLP数据集,附常用英文NLP数据集
A curated list of papers dedicated to neural text (semantic) matching.
中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取)
对四种句子/文本相似度计算方法进行实验与比较
⚛️ It is keras based implementation of siamese architecture using lstm encoders to compute text similarity
Cybertron: the home planet of the Transformers in Go
:detective: Source code plagiarism detection
Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task.
A PyTorch-based toolkit for natural language processing
Mimix: A Text Generation Tool and Pretrained Chinese Models
Machine Learning APIs for common use cases, include: General OCR (Simplified/Traditional Chinese), Custom OCR, Image Similarity, Object Recognition, Face Detection, Face Comparison, Human Image Segmentation, Human Attribute Recognition, Pornography Detection, Image Super Resolution, Text Similarity, Car License Plate, etc.
Web app to automatically generate subjective or an objective test and evaluate user responses without any human intervention in an efficient and automatic manner using machine learning and natural language processing.
GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。
Expose a Top2Vec model with a REST API.
Arabic support for textblob
[ACM-CIKM] 2nd place solution at CIKM AnalytiCup 2018, a task for determining short text similarities.
Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!
文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT
A python client for connecting to all the services provided by https://dandelion.eu
:microphone: Building a content-based podcast recommender system using NLP
Python Text Similarity NLP Libray
CHIP 2019平安医疗科技疾病问答迁移学习比赛baseline,rank7
📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
A text similarity computation using minhashing and Jaccard distance on reuters dataset
JavaScript library useful to find degrees of similarity between text's phonetics
Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475
A simplified fine tune and deploy code based on bert for text matching.