MinSang Baek's repositories
wesep
Target Speaker Extraction Toolkit
DENSE
ICASSP2025Dynamic Embedding Causal Target Speech Extraction
Target-Conversation-Extraction
This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamics"
Apollo
Music repair method to convert lossy MP3 compressed music to lossless music.
Stable-Hybrid-Auditory-Filterbanks
Official Implementation of Interspeech 2024 Paper "Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement"
pykaldi
A Python wrapper for Kaldi
webMUSHRA
a MUSHRA compliant web audio API based experiment software
wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
NOTSOFAR1-Challenge
NOTSOFAR-1 Challenge: Distant Diarization and ASR
speech_evaluation
A toolkit dedicate for speech evaluation.
tf-locoformer
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
PySDR
PySDR.org textbook source material, feel free to post issues/PRs
penn
Pitch Estimating Neural Networks (PENN)
X-TF-GridNet
The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", which is accepted by Information Fusion.
peerRTF
robust RTFs by GCN
ears_dataset
Expressive Anechoic Recordings of Speech (EARS)
SepReformer
Official repository of SepReformer for speech separation
torchcrepe
Pytorch implementation of the CREPE pitch tracker
AudioDec
An Open-source Streaming High-fidelity Neural Audio Codec
se-scaling
Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement"
silero-vad
Python Wrapper of Silero VAD
SEtrain
A training code template for DNN-based speech enhancement.
BERP
The pytorch implementation of BERP: A Blind Estimator of Room acoustic and physical Parameters
gtcrn
The official implementation of GTCRN, an ultra-lite speech enhancement model.
ddsp
DDSP: Differentiable Digital Signal Processing