castorini / pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Home Page:http://pyserini.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

In Splade example for MS Marco evaluation why index 8.8M train passages and evaluate wiht 6980 queries from dev ?

PrithivirajDamodaran opened this issue · comments

I am referring to the docs page titled "Pyserini: SPLADEv2 on MS MARCO V1 Passage Ranking"

Please advice.