DL-project

Predicting Effective Arguments

A model which classifies argumentative elements such as Lead, Position, Claim, Counterclaim, Rebuttal, Evidence, and Concluding Statement as "effective," "adequate," or "ineffective." based on essays written by U.S. students in grades 6-12.

Installation and Dependencies:

We recommend using conda environment to install dependencies of this library first. Please install (or load) conda and then proceed with the following commands:

conda create -n dl_project python=3.9
conda activate dl_project
conda install pytorch torchvision torchaudio -c pytorch
conda install -c pytorch torchtext
conda install matplotlib
pip install pyyaml
pip install transformers

Sometimes, there might be some errors when using torchtext such as

ModuleNotFoundError: No module named 'torchtext.legacy'

In this case, try downgrade your torchtext

pip install torchtext==0.10.0

Next, please install spacy since it is our default tokenizer

pip install -U pip setuptools wheel
pip install -U spacy
python -m spacy download en_core_web_sm

or, if you are using ARM/M1, run this:

pip install -U pip setuptools wheel
pip install -U 'spacy[apple]'
python -m spacy download en_core_web_sm

Code Hierarchy Table

Inside the code folder

File	Description
preprocessing.py	Splits and wraps datasets into data iterators using Torchtext
rnn.py	Includes a RNN model and a fine tune version which trains/evaluates this RNN model
models.py	Defines the RNN, LSTM, and GRU
config.yml	Sets the hyperparameter for loading data, initializing models, and training.
train.py	Defines functions that train the model, plot loss/accuracy for train/valid datasets, and make inference
run.py	Trains the model, plot loss and accuracy

Model's accuracy before preprocessing

Model	Bidirectional	Last Hidden	Loss on train	Loss on validation
RNN	True	True	0.891	0.909
RNN	True	False	0.817	0.897
RNN	False	True	0.946	0.938
RNN	False	False	0.891	0.901
LSTM	True	True	0.891	0.899
LSTM	True	False	0.804	0.895
LSTM	False	True	0.946	0.938
LSTM	False	False	0.836	0.878
GRU	True	True	0.805	0.901
GRU	True	False	0.808	0.894
GRU	False	True	0.829	0.879
GRU	False	False	0.825	0.888
Bert	NA	NA	0.974	0.971

Model's accuracy after preprocessing

Model	Bidirectional	Last Hidden	Loss on train	Loss on validation
RNN	True	True	0.917	0.929
RNN	True	False	0.852	0.883
RNN	False	True	0.903	0.903
RNN	False	False	0.874	0.891
LSTM	True	True	0.854	0.888
LSTM	True	False	0.842	0.880
LSTM	False	True	0.869	0.869
LSTM	False	False	0.829	0.892
GRU	True	True	0.832	0.880
GRU	True	False	0.815	0.884
GRU	False	True	0.833	0.885
GRU	False	False	0.831	0.893
Bert	NA	NA	0.910	0.919

Model structure

Model layers	input	output	notes
Embedding layer	(batch_size, sequence_length)	(batch_size, sequence_length, embedding_size)	Encode words as word embedding by using pretrained text vectors
RNN layer	PACKEDSEQUENCE	(sequence_length, batch_size, (D * hidden_size)) and ((D * num_layers), batch_size, hidden_size)	Count final forward and backward hidden states as the RNN output, or the average pooling of all hidden states
Fully connection layer	(batch_size, D * hidden_size)	(batch_size, 3)	Combine rnn result and get output shape match the output size
softmax layer	(batch_size, 3)	(batch_size, 3)	Softmax output for our text classification problem

RNN model

Introduction

In this section, to get the classification result, we used many-to-one RNN model the processing the whole sentence and produced the result.

Model pramater

About

Languages

Language:Jupyter Notebook 70.3%Language:Python 29.7%