Yiwen Wang's starred repositories

SOFAtoolbox

SOFA Toolbox (API for Matlab, Octave)

Language:MATLABLicense:EUPL-1.2Stargazers:114Issues:0Issues:0

RVAE-EM

Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]

Language:PythonLicense:MITStargazers:36Issues:0Issues:0

DeFT-AN-RT

Official page of "DeFT-AN RT Real-time Multichannel Speech Enhancement using Dense Frequency-Time Attentive Network and Non-overlapping Synthesis Window, in Proc. Interspeech, 2023"

Stargazers:6Issues:0Issues:0
Language:PythonStargazers:82Issues:0Issues:0

diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Language:PythonLicense:Apache-2.0Stargazers:742Issues:0Issues:0

DOSE

DOSE: Diffusion Dropout with Adaptive Prior for Speech Enhancement, Conference on Neural Information Processing Systems (NeurIPS), 2023

Language:PythonStargazers:38Issues:0Issues:0

Wave-U-Net-Pytorch

Improved Wave-U-Net implemented in Pytorch

Language:PythonLicense:MITStargazers:294Issues:0Issues:0

Neural-Speech-Dereverberation

Machine and Deep Learning models for speech dereverberation

Language:PythonLicense:GPL-3.0Stargazers:102Issues:0Issues:0

SuGaR

[CVPR 2024] Official PyTorch implementation of SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering

Language:C++License:NOASSERTIONStargazers:1976Issues:0Issues:0

Uformer

Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation

Language:PythonStargazers:91Issues:0Issues:0

clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

Language:PythonLicense:MITStargazers:115Issues:0Issues:0

Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

Stargazers:724Issues:0Issues:0

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

Stargazers:1639Issues:0Issues:0

SpeechAlgorithms

Speech Algorithms

Language:CLicense:Apache-2.0Stargazers:729Issues:0Issues:0

SemanticHearing

Real-time binaural target sound extraction model.

Language:PythonLicense:MITStargazers:61Issues:0Issues:0

Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:1301Issues:0Issues:0

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU.

Language:PythonLicense:NOASSERTIONStargazers:3Issues:0Issues:0
License:MITStargazers:3Issues:0Issues:0

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:8730Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:315Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0

Multimodal-Emotion-Recognition-Challenges

Multimodal emotion recognition code implementation on MER23 and MuSe challenges

Stargazers:7Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:42Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:101Issues:0Issues:0

MESH2IR

This is the official implementation of our mesh-based neural network (MESH2IR) to generate acoustic impulse responses (IRs) for indoor 3D scenes represented using a mesh.

Language:PythonStargazers:72Issues:0Issues:0

Pengi

An Audio Language model for Audio Tasks

Language:PythonLicense:MITStargazers:272Issues:0Issues:0

McNet

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023

Language:PythonStargazers:96Issues:0Issues:0

AudioSep

Official implementation of "Separate Anything You Describe"

Language:PythonLicense:MITStargazers:1523Issues:0Issues:0

Speech-Resources

语音方向实验室/公司/资源/实习等,欢迎推荐或自荐

Stargazers:460Issues:0Issues:0

Awesome-Speech-Pretraining

Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.

Stargazers:196Issues:0Issues:0