There are 9 repositories under paraphrase-identification topic.
BiMPM: Bilateral Multi-Perspective Matching for Natural Language Sentences
Various models and code (Manhattan LSTM, Siamese LSTM + Matching Layer, BiMPM) for the paraphrase identification task, specifically with the Quora Question Pairs dataset.
Neural network toolkit for sentence pair modeling.
Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task.
This is our team's solution report, which achieves top 10% (305/3307) in this competition.
Code for paper title "Learning Semantic Sentence Embeddings using Pair-wise Discriminator" COLING-2018
Large scale sentential paraphrases collection and annotation
Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper: https://arxiv.org/abs/2104.06979.
Variants of Multi-Perspective Convolutional Neural Networks
Paraphrase question identification using Feature Fusion Network (FFN).
Paraphrase Generation Using Deep Reinforcement Learning - MSc Thesis
Source code for SDM 2020 paper "What Do Questions Exactly Ask? MFAE: Duplicate Question Identification with Multi-Fusion Asking Emphasis"
Paraphrase Identification with Deep Learning using Keras
Matching The Statements: A Simple and Accurate Model for Key Point Analysis (ArgMining | EMNLP 2021)
Paraphase Generation
Official Repository for the paper titled "Meta-Learning for Effective Multi-task and Multilingual Modelling" accepted at EACL 2021
Large Scale Multilingual Paraphrase Corpus
Project of Paraphrase Identification Based on Weighted URAE, Unit Similarity and Context Correlation Feature
Paraphrase Detection applied to Medical domain
Materials for the RAAI Summer School 2019 workshop
This is the official repository of the paper titled "BnPC: A Gold Standard Corpus for Paraphrase Detection in Bangla, and its Evaluation", accepted in The 17th Workshop on Building and Using Comparable Corpora (BUCC 2024) co-located with LREC-COLING 2024. It contains the codes and the dataset.
I built this to automate discovery of common text between two documents. I used porter stemming, windowing, and make it save to file. I built the GUI in Java. This project was so successful that a competitor quickly added similar features and Dr. Hilton III published papers on results he discovered when using it.
The official repository for the paper "Paraphrase Detection: Human vs. Machine Content".
Deep Learning NLP Library
This repo contains models by the team of @360er0 and @tomekkorbak as they participated in Quora question pairs Kaggle contest.
Paraphrase Identification using WordNet
Check matching seven word sequences in two different pieces of text.
Implementing various measures of paraphrase detection on Microsoft Paraphrase Corpus and checking their performance on original high dimension TF-IDF matrix and it's low dimension approximation