tbmihailov / OBQA

open book question answering

OBQA

open book question answering

Data Set

OBQA Link to Dataset

Parsing and Tagging of Wh Questions

Parser and Tagger are taken from https://github.com/tdozat/Parser-v3.
Parser and Tagger are trained using CoNLL 2018 dataset :
- Git clone the repo. Create the Data Directory : data/CoNLL18/UD_English-EWT. Save both the Datasets and the embeddings.
- Training Data : http://universaldependencies.org/conll18/
- Word2Vec Embeddings : https://lindat.mff.cuni.cz/repository/xmlui/handle/11234/1-1989 . Only English Word2Vec embeddings are needed.
- Environment : TensorFlow=1.4, Scipy, Matplotlib, Psutil, Python=3.6, Pandas, Conllu
- Training Command : python main.py train ParserNetwork / TaggerNetwork
- Run Model Command : python main.py --save_dir=\$PATH_TO_NETWORK run \$INPUTFILE --output_dir=\$OUTPUTDIR
- Key Point to Note: CoNLLU format needs to be adhered strictly, Tabs between columns.
- Trained Models to be pushed at a location : [DropboxLocation]

Knowledge selection from OpenBook facts

Present in folder ir, with its own ReadMe

Word Intersection, Union and Seq2Seq Abductive IR

Present in notebooks and folders

Re-Ranking using SpaCY

Present in filterir

BERT QA models

Runner and scorer are present

About

open book question answering

Apache License 2.0

Languages

Language:Jupyter Notebook 72.3%Language:Python 27.7%Language:Shell 0.0%