AIED_2021_TRMRC_code

In this repo, we publish our code of experiments in paper: Automatic Task Requirements Writing Evaluation With Feedback via Machine Reading Comprehension

We explored many advanced MRC methods, such as BERT, BART, RoBERTa and SAN etc. , to address automatic response locating problem in task requirement questions.

We use SAN code from this work and Retro-Reader code from this work.

requirements

We only test our code on python 3.6.8.

Transformers

We use version 3.3.1 of transformers. https://github.com/huggingface/transformers

 pip install transformers==3.3.1

Spacy

We use version 2.2.4 of spacy and its small english model en-core-web-sm.

 pip install spacy==2.2.4 
 python -m spacy download en

DataSet

SQuAD 2.0

Second version of Stanford Question Answering Dataset. https://www.aclweb.org/anthology/P18-2124
SED

Student Essay Dataset， educational domain dataset, has the same structure with SQuAD 2.0.
SQuAD 2.0 & SED

Dataset consists of SED and SQuAD 2.0

Training and Evaluation

Train BERT, BART, RoBERTa, ALBERT

Run train.sh in directories named by each model. Taking training BERT as an example, you can finetune a BERT model with specific data by runing command showing below:

sh ./bert/train.sh

After training, a fine-tuned model could be found at ./bert/models/.

Evaluating: BERT, BART, RoBERTa, ALBERT

Taking BERT as an example, running the following code:

 python ./bert/bert_performance_test.py -f ./data/SED/test.json

By default, the above code will evaluate model stored in ./bert/models dir.

Train and evaluating SAN and Ours

See README.md in ./SAN and ./ours.

About

Languages

Language:Python 93.1%Language:Jupyter Notebook 6.3%Language:Shell 0.4%Language:JavaScript 0.2%Language:CSS 0.1%Language:Dockerfile 0.0%Language:Makefile 0.0%Language:Jsonnet 0.0%