facebookresearch / denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How Can I use this code in real time?

rltgjqmthf opened this issue · comments

Hi. Nice codes! Thank you for sharing.

I used this code to denoise at pycharm using pytorch. But it doesn't sound smooth and my voice delays a lot. Maybe because of CPU. So I am now trying to use FPGA hardware to make it work better.
Well, I am a newbie in programming, so I haven't fully understand the algorithm here. Now I'm having trouble with making diagrams. Can you explain what should i do for to make this code work faster? Thanks a lot.