sp-uhh / sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Wrong output shape in enhancement.py

PavelPanjaya opened this issue · comments

The shape of the output tensor x_hat is torch.Size([2, 758949]) which can't be written as audio file because of the first dimension.

The model is designed for single-channel speech enhancement; you may be providing a stereo input file?

Thanks for your response,
I've converted my audio to mono and it works now.
Thanks.

Great! Closing this issue.