yetingqiaqia / L2R-Software-TR-QAs

This is the dataset for Experiments of "Learning to Rank for Question Oriented Software Text Retrieval"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

L2R-Software-TR-QAs

This is the dataset for Experiments of "Learning to Rank for Question Oriented Software Text Retrieval"

We provide three datasets here, including the Lucene Tag dataset, Java Tag dataset and 7 projects' FAQs.

Each question is stored in a file, so as to each answer.

For the Lucene Tag dataset, Java Tag dataset, the questions and answers are named as follows:
(1) the file name of question is “QuestionId_Q.txt”
(2) the file name of answer is "AnswerId_QuestionId_A.txt"

For FAQs, the questions and answers are named as follows:
(1) the file name of question is "ProjectName-Q"+"QAId"+".txt"
(2) the file name of answer is "ProjectName-A"+"QAId"+".txt"
(3) the classification result is in files named "InterrogativeNames-Q.txt"

Thanks.

Any questions, please email me: "yeting@pku.edu.cn"

Ting Ye
20150516

About

This is the dataset for Experiments of "Learning to Rank for Question Oriented Software Text Retrieval"