Windstudent

Lu Zhang's repositories

Microphone-Array-Simulation-Environment

This code can simulate the MATLAB environment of uniform linear microphone array. It can define room size, reverberation degree, the number and location of microphones, and reduce the dependence on hardware experimental environment to a certain extent.

Language:MATLAB29 30

AAS_enhancement

This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervision".

Language:Python000

audio_visual_speech_enhancement

Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments

Language:Python000

ava-dataset

The AVA dataset densely annotates 80 atomic visual actions in 351k movie clips with actions localized in space and time, resulting in 1.65M action labels with multiple labels per human occurring frequently.

000

conv-tasnet

A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation"

Language:PythonMIT000

DDAEC

000

Deep-Learning-for-Speech-Enhancement

Remove noise from sound clips by use of supervised training and an ideal ratio mask.

Language:Python000

DeepXi

Deep Xi: A Deep Learning Approach to A Priori SNR Estimation. Used for Speech Enhancement and robust ASR.

MPL-2.0000

denoising_DIHARD18

000

dl_signal

Deep Learning Model for Signal Data

MIT000

IRM-based-Speech-Enhancement-using-DNN

Ideal Ratio Mask (IRM) estimation based Speech Enhancement using DNN

Language:PythonMIT000

Listening_test_Demos

Language:Mathematica000

LPCNet

Efficient neural speech synthesis

Language:CBSD-3-Clause000

MAIR-Library-and-Renderer

MAIR is an open-access library of an extensive set of room impulse responses (RIRs) captured using a total of 40+ microphone techniques from 2ch stereo to 9ch 3D as well as those for 360deg audio capture. The library is provided with a rendering tool that enables the user to create virtual recordings for both loudspeaker and binaural playback.

CC-BY-4.0000

noise_adaptive_DAT_SE

Noise Adaptive Speech Enhancement using Domain Adversarial Training

000

noisereduce

Noise reduction / speech enhancement for python using spectral gating

Language:Jupyter NotebookMIT000

norbert

Painless Wiener filters for audio separation

Language:PythonMIT000

onssen

An open-source speech separation and enhancement library

000

Perceptual-Weighting-Filter-Loss

A perceptual weighting filter loss for DNN training in speech enhancement

Language:MATLAB000

Seq-U-Net

Official implementation of the Seq-U-Net for efficient sequence modelling

MIT000

SNR-Based-Progressive-Learning-of-Deep-Neural-Network-for-Speech-Enhancement

The implementation of the paper SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement.

Language:PythonMIT000

speaker_extraction

target speaker extraction and verification for multi-talker speech

GPL-3.0000

speech-denoising-wavenet

A neural network for end-to-end speech denoising

MIT000

Speech_Enhancement_DNN_NMF

Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF

Language:Python000

SpeechDenoisingWithDeepFeatureLosses

Speech Denoising with Deep Feature Losses

MIT000

TensorFlow-speech-enhancement-Chinese

基于深度学习的语音增强、去混响

Language:Python000

Wave-U-Net

Implementation of the Wave-U-Net for audio source separation

MIT000

Wave-U-Net-For-Speech-Enhancement

Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemented for the task of speech enhancement in the time-domain.

Language:Python000

wavelet-denoising

Speech enhancement based on adaptive wavelet denoising on multitaper spectrum

Language:MATLAB000

WebRTC_NS

Noise Suppression Module Port From WebRTC

Language:CBSD-3-Clause000