wyw97

Yiwen Wang's starred repositories

SOFAtoolbox

SOFA Toolbox (API for Matlab, Octave)

Language:MATLABEUPL-1.211400

RVAE-EM

Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]

Language:PythonMIT3600

DeFT-AN-RT

Official page of "DeFT-AN RT Real-time Multichannel Speech Enhancement using Dense Frequency-Time Attentive Network and Non-overlapping Synthesis Window, in Proc. Interspeech, 2023"

600

diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Language:PythonApache-2.074200

DOSE

DOSE: Diffusion Dropout with Adaptive Prior for Speech Enhancement, Conference on Neural Information Processing Systems (NeurIPS), 2023

Language:Python3800

Wave-U-Net-Pytorch

Improved Wave-U-Net implemented in Pytorch

Language:PythonMIT29400

Neural-Speech-Dereverberation

Machine and Deep Learning models for speech dereverberation

Language:PythonGPL-3.010200

SuGaR

[CVPR 2024] Official PyTorch implementation of SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering

Language:C++NOASSERTION197600

Uformer

Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation

Language:Python9100

clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

Language:PythonMIT11500

Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

72400

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

163900

SpeechAlgorithms

Speech Algorithms

Language:CApache-2.072900

SemanticHearing

Real-time binaural target sound extraction model.

Language:PythonMIT6100

Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonNOASSERTION130100

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU.

Language:PythonNOASSERTION300

wyw97

Yiwen Wang's starred repositories

SOFAtoolbox

RVAE-EM

DeFT-AN-RT

EEG-To-Text

diffwave

DOSE

Wave-U-Net-Pytorch

Neural-Speech-Dereverberation

SuGaR

Uformer

clarity

Speech-Separation-Paper-Tutorial

voice_datasets

SpeechAlgorithms

SemanticHearing

Qwen-Audio

denoiser

Ny-EnhTT

Transformers-Tutorials

uss

Param-GTFB-GCFB

Multimodal-Emotion-Recognition-Challenges

ml-nvas3d

spherical-cnn

MESH2IR

Pengi

McNet

AudioSep

Speech-Resources

Awesome-Speech-Pretraining