sentence-classification sentence-representation deep-models pytorch

Deep learning models for sentence representation on classification in PyTorch

This repository contains some popular deep learning models for sentence representation (also apply for document-level text) that built in PyTorch. Intended for learning PyTorch, this repo is made understandable for someone with basic python and deep learning knowledge. Links to some papers are also given.

Requirement

python 2.7
pytorch 0.2
torchtext 0.2

Usage

python train.py -conf [config file]

Choose the config file that used to set the datasets and models.

Folder Structure

model file:
- model/model.py, contains the deep models for sentence representation.
training framework: train.py - preprocesses the data and trains the model.
configuration files:
- i.e. trec/trec.conf, the config file used to set the datasets and models.
help function: utils/utils.py. some helper functions.

Models [IN PROGRESS]

For now, the models listed bellow are add into this repo. Some benchmarks for these models are also given (the hyper-parameters are far from being optimal, the performances of these models can be improved with carefully tuning).

Model	TREC6-valid¹	TREC6-test	SST2-valid²	SST2-test
LSTM	-	94.6	84.98	85.45
Bi-LSTM	-	94.4	85.21	86.44
CNN	-	95.2	84.63	84.73
SelfAttn	-	96.0	85.44	86.66
BCN+CoVe	-	95.0	87.55	87.84

1: The best accuracy on test set is reported since it has no development set.

2: Only the sentence-level training samples are used.

About

A collection of deep learning models for sentence representation on classification that implemented in PyTorch

sentence-classification sentence-representation deep-models pytorch

MIT License

Languages

Language:Python 100.0%

Deep learning models for sentence representation on classification in PyTorch

Requirement

Usage

Folder Structure

Models [IN PROGRESS]

LSTMs

CNNs

Self-Attentive Sentence Embedding

Learned in Translation: Contextualized Word Vectors

About

Languages