Guochen Yu's repositories
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
audio-generation-papers
recent audio generation papers (including speech, music and general audios)
CDiffuSE
Conditional Diffusion Probabilistic Model for Speech Enhancement
CLAP
Contrastive Language-Audio Pretraining
DeepFilterNet2
Noise supression using deep filtering
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
leetcode
Leetcode solutions
LPCNet
Efficient neural speech synthesis
NKF-AEC
Acoustic Echo Cancellation with Nerual Kalman Filtering
NLP-Tutorials
Simple implementations of NLP models. Tutorials are written in Chinese on my website https://mofanpy.com
opus
Modern audio compression for the internet.
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
SpeechGPT
SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities.
vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Can be trained on a single GPU!
wavegrad
A fast, high-quality neural vocoder.