Extreme Classification for Answer Type Prediction in Question Answering

Citation

If you are using the contents of this repository please cite it this way:

@article{Setty:2023:Arxiv,
  title={Extreme Classification for Answer Type Prediction in Question Answering},
  author={Setty, Vinay},
  journal={arXiv preprint arXiv:2304.12395},
  year={2023}
}

Abstract

Semantic answer type prediction (SMART) is known to be a useful step towards designing effective question answering (QA) systems. The SMART task involves predicting the top-$k$ knowledge graph (KG) types for a given natural language question. This is challenging due to the large number of types in KGs. In this paper, we propose use of extreme multi-label classification using Transformer models (XBERT) by clustering KG types using structural and semantic features based on question text. We specifically improve the clustering stage of the XBERT pipleline using textual and structural features derived from KGs. We show that these features can improve end-to-end performance for the SMART task, and yield state-of-the-art results.

Installation

Create a conda environment conda create -n jcdl-smart-2023
Install the requirements pip install -r requirements.txt
Install apex a. Clone apex repository b. install with command pip install -v --disable-pip-version-check --no-cache-dir ./

Usage

On a GPU server run python -m smart.classification.category.bert_category_classifier
Modify the bash script scripts/run_experiments.sh to specify appropriate datasets, embedding types and Transformer models.

EMB_LIST=( pifa-neural text-emb pifa-tfidf )
dataset_list=( wikidata dbpedia )
model_list=( roberta xlnet )

Then run ./scripts/run_experiments.sh
It should run full pipeline with feature generation to model training, ranking, prediction and evaluation using the script. It should print NDCG for DBPedia and MRR for Wikidata.
To run with a sample data and for pytest unit tests to pass run ./scripts/run_experiments.sh True The True flag runs it in test mode.

Results

Method	DBpedia Type Classification			DBpedia End-to-End			Wikidata Type Classification	Wikidata End-to-End
	NDCG@3	NDCG@5	NDCG@10	NDCG@3	NDCG@5	NDCG@10	MRR	MRR
Question text (TF-IDF)	0.717	0.693	0.650	0.824	0.811	0.787	0.66	0.76
KG-TypeSim	0.725	0.697	0.662	0.828	0.813	0.793	0.67	0.77
KG-RDF2Vec	0.729	0.701	0.656	0.831	0.815	0.791	0.67	0.78
BERT-TypeDesc	0.727	0.706	0.665	0.830	0.818	0.795	0.67	0.78
BERT-TypeDesc-FT	0.734	0.712	0.678	0.834	0.822	0.802	0.68	0.79

About

Languages

Language:Python 64.6%Language:C++ 30.1%Language:Shell 4.8%Language:Makefile 0.4%