ChewKokWah / unifiedqa

UnifiedQA: Crossing Format Boundaries With a Single QA System

Home Page:https://arxiv.org/abs/2005.00700

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

UnifiedQA

You may want to check out:

Released Model Checkpoints

If you intend to create a QA system, you can use our QA-specialized models for your purpose:

T5 models

Note: In the experiments reported in our paper we always used the checkpoint closest to 100k steps (it usually corresponds to checkpoint 1100500)

You can use these in two ways:

  • If you don't have any training data, you can use them for the evaluation.
  • If you training data, you can use them as your initial models and fine-tune on them.

For more details see the T5 repository.

BART models

The uncased models uslaly gave us better and more robust results.

The datasets/tasks used in the experiments

While the datasets we used are all public, it could be a bit time-confusing to convert them all into text-to-text format. We're releasing the already proccessed text-to-text datasets: [To be updated]

Prediction files

We're making the predictions of the many of our models available. [To be updated]

How to cite

If you extend or use this work, please cite the paper:

@article{2020unifiedqa,
    title={UnifiedQA: Crossing Format Boundaries With a Single QA System},
    author={D. Khashabi and S. Min and T. Khot and A. Sabhwaral and O. Tafjord and P. Clark and H. Hajishirzi},
    journal={arXiv preprint},
    year={2020}
}

About

UnifiedQA: Crossing Format Boundaries With a Single QA System

https://arxiv.org/abs/2005.00700