facebookresearch / DrQA

Reading Wikipedia to Answer Open-Domain Questions

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Bias In Retriever towards Long Texts

kuldeep7688 opened this issue · comments

I tried the retriever module for getting documents related to the question but unfortunately almost everytime long documents were suggested as the best matched.

I tried to find out whether the tf vector is normalized in the compressed sparse matrix creation but couldn't.

Can someone help me whether I am right or wrong ?
And has anyone noticed this ?