Yiwen Wang's starred repositories
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
SpeechAlgorithms
Speech Algorithms
Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
Speech-Resources
语音方向实验室/公司/资源/实习等,欢迎推荐或自荐
Wave-U-Net-Pytorch
Improved Wave-U-Net implemented in Pytorch
Awesome-Speech-Pretraining
Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.
Neural-Speech-Dereverberation
Machine and Deep Learning models for speech dereverberation
SemanticHearing
Real-time binaural target sound extraction model.
Multimodal-Emotion-Recognition-Challenges
Multimodal emotion recognition code implementation on MER23 and MuSe challenges
DeFT-AN-RT
Official code of "DeFT-AN RT Real-time Multichannel Speech Enhancement using Dense Frequency-Time Attentive Network and Non-overlapping Synthesis Window, in Proc. Interspeech, 2023"
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU.