castorini / pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Home Page:http://pyserini.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Pyserini download index doesn't actually appear to check tarball size

lintool opened this issue · comments

Currently only checks MD5.
We store the file size in the Dict, so it'd be easy to check.