There are 22 repositories under text-similarity topic.
Resume Matcher is an open source, free tool to improve your resume. It works by using AI, Reader LLMs, to compare and rank resumes with job descriptions.
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
搜索所有中文NLP数据集,附常用英文NLP数据集
A curated list of papers dedicated to neural text (semantic) matching.
中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取)
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
作业查重软件,它实现了程序代码、文档文本、图片之间的相似度检查。a code-similarity, text-similarity and image-similarity computation software for the codes, documents and images of assignment.
Cybertron: the home planet of the Transformers in Go
对四种句子/文本相似度计算方法进行实验与比较
⚛️ It is keras based implementation of siamese architecture using lstm encoders to compute text similarity
:detective: Source code plagiarism detection
Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task.
Machine Learning APIs for common use cases, include: General OCR (Simplified/Traditional Chinese), Custom OCR, Image Similarity, Object Recognition, Face Detection, Face Comparison, Human Image Segmentation, Human Attribute Recognition, Pornography Detection, Image Super Resolution, Text Similarity, Car License Plate, etc.
Mimix: A Text Generation Tool and Pretrained Chinese Models
A PyTorch-based toolkit for natural language processing
Web app to automatically generate subjective or an objective test and evaluate user responses without any human intervention in an efficient and automatic manner using machine learning and natural language processing.
GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。
Expose a Top2Vec model with a REST API.
Arabic support for textblob
[ACM-CIKM] 2nd place solution at CIKM AnalytiCup 2018, a task for determining short text similarities.
文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT
Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!
Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!
A python client for connecting to all the services provided by https://dandelion.eu
Python Text Similarity NLP Libray
:microphone: Building a content-based podcast recommender system using NLP
📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
CHIP 2019平安医疗科技疾病问答迁移学习比赛baseline,rank7
Two-part information retrieval system: 1) Pre-process text files, generate TF-IDF matrix and inverted index. 2) Retrieve relevant documents ranked by cosine similarity for given queries.
Simple in-memory vector database for text similarity in Node.js
JavaScript library useful to find degrees of similarity between text's phonetics
Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475