p-lambda / dsir

DSIR large-scale data selection framework for language model training

Home Page:https://arxiv.org/abs/2302.03169

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Request for weights of pretrained model

mmkytuzszd71 opened this issue · comments

Great work!

Could you provide the weights of BERT-style masked language models pretrained on the selected data?

I've pushed links to all the relevant pretrained models in the README.