fine-tuning sample rate

Question

fine-tuning sample rate

skirdey opened this issue 2 years ago · comments

When using pre-trained models for fine-tuning, shall the fine-tuning training set have a specific sample rate, like 16khz?

Yuan Gong · Answer 1 · Wed Jun 29 2022 03:20:33 GMT+0800 (China Standard Time)

The model is pretrained with only 16kHz data (both AudioSet and Librispeech we use to train the model are re-sampled to 16kHz), so my guess is in the fine-tuning stage, the sampling rate should be consistent. Otherwise you can pretrain the model using a different sampling rate, the pretraining is not that expensive (a few days on 4X1080 GPUs).

-Yuan