Jackthebighead / duplicate-question-pair-identification

Three models are implemented for text similarity classification/STS problem on Quora Question Pairs dataset.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repo records the term project for Machine Learning course.

This repository contains model solutions for duplicate question(sentence) pair classification on QQP dataset, which solves the problem of Semantic Text Simmilarity in NLP. This is a continuous updating repository. Newly updates are shown before the documentation of the projects.

Newly update (2021.1.15)

  • implemented Enhanced RCNN
  • model stacking, tbc..

Update (2020.12.21)

  • sentence BERT(Siamese BERT) is tried.
  • ESIM is tried

About

Three models are implemented for text similarity classification/STS problem on Quora Question Pairs dataset.