facebookresearch / denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Very distorted output

MohammedMehdiTBER opened this issue · comments

I am using these settings for separation, yet I only get goods results from track recorded in studio not the tracks that are recorded using legacy mics or phone recorder even though the voice is clear.

python -m denoiser.enhance --device cpu --master64 --dry 0 --noisy_dir "C:\Users\MohammedMehdiTBER\Music\wav" --out_dir "C:\Users\MohammedMehdiTBER\Music\wavout"

--streaming is used for long tracks, but I am not sure if It makes the quality worse. The option cannot be used with --valentini_nc