config for reproducing the provided model clmr_checkpoint_10000.pt

Question

config for reproducing the provided model clmr_checkpoint_10000.pt

pengbo-learn opened this issue 3 years ago · comments

I trained and evaluated with the default config, which resulted in roc auc 86.3.

Then I find the config is inconsistent with the paper's golden setttings,

                     config.yaml        paper
batch size            48                 96
max epoches          200               1000
sample rate        16000              22050

I changed these params and retrained, got roc auc 87.3, a lot better. Still, it was far from the 88.5 in the paper.

Meanwhile, finetuning linear classifier from clmr_checkpoint_10000.pt did work.

So I wonder if you could provide a config setting for reproducing the pretrained model.

Janne · Answer 1 · Wed Jun 16 2021 16:47:14 GMT+0800 (China Standard Time)

Hi there! From the looks of it, you trained at 16kHz. The paper's best results were indeed reached when training the SSL model at 22.5kHz, for 10.000 epochs. I'll update the config soon!