Guochen Yu (yuguochencuc)

yuguochencuc

Geek Repo

Company:Kuaishou Technology

Location:Beijing

Home Page:https://yuguochencuc.github.io/

Github PK Tool:Github PK Tool

Guochen Yu's repositories

DB-AIAT

The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"

Language:PythonLicense:MITStargazers:113Issues:3Issues:9

BAE-Net

BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION

SF-Net

The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"

DBT-Net

The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement" are provided (submitted to TASLP). The code will also be released soon.

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

License:GPL-2.0Stargazers:0Issues:0Issues:0

audio-generation-papers

recent audio generation papers (including speech, music and general audios)

License:Apache-2.0Stargazers:0Issues:0Issues:0

CDiffuSE

Conditional Diffusion Probabilistic Model for Speech Enhancement

License:Apache-2.0Stargazers:0Issues:0Issues:0

CLAP

Contrastive Language-Audio Pretraining

License:CC0-1.0Stargazers:0Issues:0Issues:0

DeepFilterNet2

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

License:NOASSERTIONStargazers:0Issues:0Issues:0

FastDiff

PyTorch Implementation of FastDiff (IJCAI'22)

Stargazers:0Issues:0Issues:0

FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

License:MITStargazers:0Issues:0Issues:0

gpuRIR

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

License:AGPL-3.0Stargazers:0Issues:0Issues:0

leetcode

Leetcode solutions

Stargazers:0Issues:0Issues:0

LPCNet

Efficient neural speech synthesis

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

NKF-AEC

Acoustic Echo Cancellation with Nerual Kalman Filtering

Stargazers:0Issues:0Issues:0

NLP-Tutorials

Simple implementations of NLP models. Tutorials are written in Chinese on my website https://mofanpy.com

License:MITStargazers:0Issues:0Issues:0

opus

Modern audio compression for the internet.

License:NOASSERTIONStargazers:0Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

SpeechGPT

SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities.

Stargazers:0Issues:0Issues:0

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Can be trained on a single GPU!

License:Apache-2.0Stargazers:0Issues:0Issues:0

wavegrad

A fast, high-quality neural vocoder.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0