ZeweiChu / MQR

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

MQR

The is the repository for the paper How to Ask Better Questions? A Large-Scale Multi-Domain Dataset for Rewriting Ill-Formed Questions

Train Data

Dev/Test Data and Model Predictions

  • The dev and test datasets are under directory data
  • The rewritten dev/test splits can be found under each subdirectory of data

Annotation

License

The MQR dataset is under cc-by-sa 4.0 license, intended to be shared and remixed.

The MQR dataset is partially constructed from the Stack Exchange data dumps

We used Quora Question Pairs dataset as part of the training data

We also used the Paralex dataset for training

Reference

@inproceedings{chu-mqr-20,
  author    = {Zewei Chu and Mingda Chen and Jing Chen and Miaosen Wang and Kevin Gimpel and Manaal Faruqui and Xiance Si},
  title     = {How to Ask Better Questions? A Large-Scale Multi-Domain Dataset for Rewriting Ill-Formed Questions},
  booktitle = {Proc. of {AAAI}},
  year      = {2020}
}

About