I can not reproduce the result that reported in the paper

Question

I can not reproduce the result that reported in the paper

KhanhNguyen4999 opened this issue 2 years ago · comments

I trained model using script launch_valentini.sh on dataset valentini 2017, with spk287 and spk286 in testset (resample from 48k to 16k using sox). But I got pesq=2.62 and stoi=0.92 after 400 epoch, the result is very smaller than paper's report.

In the paper reported that noisy data has pesq=1.97 and stoi=91.5, but I recalculated using code in this repo, I modified in denoiser/evaluate.py line 94(run with cpu). Specifically, if I want to calculate pesq of clean data, in line 94, I replace "estimate" by "clean", otherwise replace by "noisy":

noisy data: pesq=1.5, stoi=0.84
clean data: 4.64, stoi=1

What wrongs in this result? how can I reproduce correctly? Please help me!

Khánh Nguyễn · Answer 1 · Mon Jul 18 2022 22:29:54 GMT+0800 (China Standard Time)

Hope to hear from you soon!, I am still stuck here

Alexandre Défossez · Answer 2 · Tue Jul 19 2022 17:51:58 GMT+0800 (China Standard Time)

Have you tried using the pretrained model on valentini and compute the PESQ and STOI on your dataset ? This would show if there is a mismatch between your data and ours.

Khánh Nguyễn · Answer 3 · Sun Jul 31 2022 00:23:30 GMT+0800 (China Standard Time)

Yes, I did, but pesq and stoi on dns48 pretrain model was very bad. Pesq=2.12 and Stoi=0.89

Alexandre Défossez · Answer 4 · Mon Aug 01 2022 17:09:40 GMT+0800 (China Standard Time)

can you try with the valentini pretrained model

Khánh Nguyễn · Answer 5 · Mon Aug 01 2022 17:47:18 GMT+0800 (China Standard Time)

yep, I have tried master64 pretrain model, and gain pesq=2.69, stoi=0.928. Is there anything wrong?

Alexandre Défossez · Answer 6 · Mon Aug 01 2022 21:43:07 GMT+0800 (China Standard Time)

Can you try with the model --valentini_nc, this one is trained only on valentini.

Khánh Nguyễn · Answer 7 · Tue Aug 02 2022 08:46:02 GMT+0800 (China Standard Time)

I also tried using valentini_nc pretrain model but pesq and stoi didn't change:
-master64: pesq=2.6966, stoi=0.9281
-valentini_nc: pesq=2.6977, stoi=0.9287

with the way I calculate pesq and stoi for the noisy audio in the first comment, let me know how do you calculate pesq and stoi for noisy audio, please? Because I see a gap here

Yossi Adi · Answer 8 · Thu Dec 01 2022 20:38:55 GMT+0800 (China Standard Time)

Hi @KhanhNguyen4999,
This is strange. One reason for the gap might be a change in the valentini dataset. I saw there is a newer version of VCTK dataset, which is the basis of valentini.
However, the drop in performance should not be big and should only observed in the pretrained model (I got 0.94 stoi and 2.91 pesq). When training from scratch I got stoi 95 and 2.95 pesq