haoheliu / ssr_eval

Evaluation and Benchmarking of Speech Super-resolution Methods

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Use the pretrained model to predict 16 kHz track

ruizhecao96 opened this issue · comments

Hello,

Your work is awesome! Can I use the provided 48 kHz pretrained model to predict the 16 kHz track directly? e.g. 4 kHz to 16 kHz. Or do I need to retrain the model?

Best regards

@ruizhecao96 Thanks for your interest! Sure, you can predict the 16kHz track. The target sampling rate of the pre-trained NVSR model is 44.1kHz. You can first resample your input to 44.1kHz. Then use our model to perform super-resolution. And finally resample it back to 16kHz.

@ruizhecao96 Thanks for your interest! Sure, you can predict the 16kHz track. The target sampling rate of the pre-trained NVSR model is 44.1kHz. You can first resample your input to 44.1kHz. Then use our model to perform super-resolution. And finally resample it back to 16kHz.

I am not sure if I get the point. For example, I have a 4 kHz audio and I want to upsample it to 16 kHz using pretrained NVSR. Do you mean I first upsample it to 44.1 kHz using NVSR and then resample it to 16 kHz?

Do you mean I first upsample it to 44.1 kHz using NVSR and then resample it to 16 kHz?

Yes, that's right.

Thank you very much!