Lu Zhang's repositories
Microphone-Array-Simulation-Environment
This code can simulate the MATLAB environment of uniform linear microphone array. It can define room size, reverberation degree, the number and location of microphones, and reduce the dependence on hardware experimental environment to a certain extent.
AAS_enhancement
This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervision".
audio_visual_speech_enhancement
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
ava-dataset
The AVA dataset densely annotates 80 atomic visual actions in 351k movie clips with actions localized in space and time, resulting in 1.65M action labels with multiple labels per human occurring frequently.
conv-tasnet
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation"
Deep-Learning-for-Speech-Enhancement
Remove noise from sound clips by use of supervised training and an ideal ratio mask.
DeepXi
Deep Xi: A Deep Learning Approach to A Priori SNR Estimation. Used for Speech Enhancement and robust ASR.
dl_signal
Deep Learning Model for Signal Data
IRM-based-Speech-Enhancement-using-DNN
Ideal Ratio Mask (IRM) estimation based Speech Enhancement using DNN
LPCNet
Efficient neural speech synthesis
MAIR-Library-and-Renderer
MAIR is an open-access library of an extensive set of room impulse responses (RIRs) captured using a total of 40+ microphone techniques from 2ch stereo to 9ch 3D as well as those for 360deg audio capture. The library is provided with a rendering tool that enables the user to create virtual recordings for both loudspeaker and binaural playback.
noise_adaptive_DAT_SE
Noise Adaptive Speech Enhancement using Domain Adversarial Training
noisereduce
Noise reduction / speech enhancement for python using spectral gating
norbert
Painless Wiener filters for audio separation
onssen
An open-source speech separation and enhancement library
Perceptual-Weighting-Filter-Loss
A perceptual weighting filter loss for DNN training in speech enhancement
Seq-U-Net
Official implementation of the Seq-U-Net for efficient sequence modelling
SNR-Based-Progressive-Learning-of-Deep-Neural-Network-for-Speech-Enhancement
The implementation of the paper SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement.
speaker_extraction
target speaker extraction and verification for multi-talker speech
speech-denoising-wavenet
A neural network for end-to-end speech denoising
Speech_Enhancement_DNN_NMF
Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF
SpeechDenoisingWithDeepFeatureLosses
Speech Denoising with Deep Feature Losses
TensorFlow-speech-enhancement-Chinese
基于深度学习的语音增强、去混响
Wave-U-Net
Implementation of the Wave-U-Net for audio source separation
Wave-U-Net-For-Speech-Enhancement
Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemented for the task of speech enhancement in the time-domain.
wavelet-denoising
Speech enhancement based on adaptive wavelet denoising on multitaper spectrum
WebRTC_NS
Noise Suppression Module Port From WebRTC