Wrong output shape in enhancement.py

Question

PavelPanjaya opened this issue 4 months ago · comments

The shape of the output tensor x_hat is torch.Size([2, 758949]) which can't be written as audio file because of the first dimension.

Simon Welker · Answer 1 · Sun Apr 21 2024 19:16:22 GMT+0800 (China Standard Time)

The model is designed for single-channel speech enhancement; you may be providing a stereo input file?

PavelPanjaya · Answer 2 · Sun Apr 21 2024 19:33:29 GMT+0800 (China Standard Time)

Thanks for your response,
I've converted my audio to mono and it works now.
Thanks.

Simon Welker · Answer 3 · Sun Apr 21 2024 19:36:04 GMT+0800 (China Standard Time)

Great! Closing this issue.