texttron / tevatron

Tevatron - A flexible toolkit for neural retrieval research and development.

Home Page:http://tevatron.ai

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to get the title in msmarco-passage

BeastyZ opened this issue · comments

Hi, @MXueguang
According to the collection in TREC 2019 Deep Learning Track Guidelines, there is no 'title' in corpus. But I see the 'title' in your Tevatron/msmarco-passage. May I know how you get the title?

Hi @BeastyZ , we follow the rocketqa released code/data to create the Tevatron/msmarco-passage, which contains title augmentation.

btw, this paper https://arxiv.org/pdf/2304.12904.pdf have an analysis of with/ with out title

Thank you for your timely help many times.