yuguochencuc

followers

following

stars

Kuaishou Technology

Beijing

https://yuguochencuc.github.io/

Guochen Yu's repositories

DB-AIAT

The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"

Language:PythonMIT113 3 9

BAE-Net

BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION

Language:Python50 8 8

SF-Net

The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"

Language:Python50 2 1

DBT-Net

The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement" are provided (submitted to TASLP). The code will also be released soon.

Language:Python28 1 1

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookMIT100

sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Language:PythonMIT100

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonMIT000

awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

GPL-2.0000

audio-generation-papers

recent audio generation papers (including speech, music and general audios)

Apache-2.0000

CDiffuSE

Conditional Diffusion Probabilistic Model for Speech Enhancement

Apache-2.0000

CLAP

Contrastive Language-Audio Pretraining

CC0-1.0000

DeepFilterNet2

Noise supression using deep filtering

Language:PythonNOASSERTION000

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Apache-2.0000

echocatzh

000

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

NOASSERTION000

FastDiff

PyTorch Implementation of FastDiff (IJCAI'22)

000

FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

MIT000

gpuRIR

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

AGPL-3.0000

leetcode

Leetcode solutions

000

LPCNet

Efficient neural speech synthesis

BSD-3-Clause000

NKF-AEC

Acoustic Echo Cancellation with Nerual Kalman Filtering

000

NLP-Tutorials

Simple implementations of NLP models. Tutorials are written in Chinese on my website https://mofanpy.com

MIT000

opus

Modern audio compression for the internet.

NOASSERTION000

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Apache-2.0000

SDCM

000

SpeechGPT

SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities.

000

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Can be trained on a single GPU!

Apache-2.0000

wavegrad

A fast, high-quality neural vocoder.

Apache-2.0000

yuguochencuc

000

yuguochencuc.github.io

Language:HTML000