Langboat / mengzi-retrieval-lm

An experimental implementation of the retrieval-enhanced language model

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

prepare_load.py dosen't filter the len(input_ids) < chunk_size data like dataset.py

dumpmemory opened this issue · comments

I found that the logic in prepare_load.py is different from dataset.py. prepare_load didn't filter the data which len(input_ids) < chunk_size like

if len(input_ids) >= chunk_size:
. which one should i follow.