ShahRutav / EMNLP-2021-Findings

This repo has the code for the paper "Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework" accepted at EMNLP 2021 Findings.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework

This repo has the code for the paper "Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework" accepted at EMNLP 2021 Findings. The blog on this paper can be found here, the poster here, and a corresponding presentation here.

Required dependencies -

Please run pip install -r requirements.txt (python3 required)

E-Manual pre-training corpus

Go to this link. A RoBERTa BASE Model pre-trained on the corpus can be found here, and a BERT BASE UNCASED Model pre-trained on the same here.

Codes

Baselines

  1. Dense Passage Retrieval(DPR) - Used HuggingFace implementation (https://huggingface.co/transformers/model_doc/dpr.html)
  2. Technical Answer Prediction (TAP) - took the help of code in https://github.com/IBM/techqa
  3. MultiSpan - took the help of code in https://github.com/eladsegal/tag-based-multi-span-extraction

Citation

Please cite the work if you would like to use it.

@article{nandy2021question,
  title={Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework},
  author={Nandy, Abhilash and Sharma, Soumya and Maddhashiya, Shubham and Sachdeva, Kapil and Goyal, Pawan and Ganguly, Niloy},
  journal={arXiv preprint arXiv:2109.05897},
  year={2021}
}

About

This repo has the code for the paper "Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework" accepted at EMNLP 2021 Findings.


Languages

Language:HTML 94.7%Language:Python 4.5%Language:Jupyter Notebook 0.9%