Use the pretrained model to predict 16 kHz track

Question

Use the pretrained model to predict 16 kHz track

ruizhecao96 opened this issue 2 years ago · comments

Hello,

Your work is awesome! Can I use the provided 48 kHz pretrained model to predict the 16 kHz track directly? e.g. 4 kHz to 16 kHz. Or do I need to retrain the model?

Best regards

Haohe (Leo) Liu / 刘濠赫 · Answer 1 · Sat May 14 2022 19:23:06 GMT+0800 (China Standard Time)

@ruizhecao96 Thanks for your interest! Sure, you can predict the 16kHz track. The target sampling rate of the pre-trained NVSR model is 44.1kHz. You can first resample your input to 44.1kHz. Then use our model to perform super-resolution. And finally resample it back to 16kHz.

Ruizhe Cao · Answer 2 · Sat May 14 2022 20:12:29 GMT+0800 (China Standard Time)

@ruizhecao96 Thanks for your interest! Sure, you can predict the 16kHz track. The target sampling rate of the pre-trained NVSR model is 44.1kHz. You can first resample your input to 44.1kHz. Then use our model to perform super-resolution. And finally resample it back to 16kHz.

I am not sure if I get the point. For example, I have a 4 kHz audio and I want to upsample it to 16 kHz using pretrained NVSR. Do you mean I first upsample it to 44.1 kHz using NVSR and then resample it to 16 kHz?

Haohe (Leo) Liu / 刘濠赫 · Answer 3 · Sat May 14 2022 20:54:10 GMT+0800 (China Standard Time)

Do you mean I first upsample it to 44.1 kHz using NVSR and then resample it to 16 kHz?

Yes, that's right.

Ruizhe Cao · Answer 4 · Sat May 14 2022 23:14:32 GMT+0800 (China Standard Time)

Thank you very much!