BaekMS

followers

following

stars

MinSang Baek's repositories

SonicSim

CC-BY-SA-4.0000

wesep

Target Speaker Extraction Toolkit

000

DENSE

ICASSP2025Dynamic Embedding Causal Target Speech Extraction

000

Target-Conversation-Extraction

This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamics"

NOASSERTION000

Apollo

Music repair method to convert lossy MP3 compressed music to lossless music.

000

Stable-Hybrid-Auditory-Filterbanks

Official Implementation of Interspeech 2024 Paper "Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement"

BSD-3-Clause-Clear000

pykaldi

A Python wrapper for Kaldi

Apache-2.0000

pyneuralfx

MIT000

webMUSHRA

a MUSHRA compliant web audio API based experiment software

NOASSERTION000

wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Apache-2.0000

NOTSOFAR1-Challenge

NOTSOFAR-1 Challenge: Distant Diarization and ASR

MIT000

speech_evaluation

A toolkit dedicate for speech evaluation.

Apache-2.0000

tf-locoformer

Transformer with Local Modeling by Convolution for Speech Separation and Enhancement

Apache-2.0000

PySDR

PySDR.org textbook source material, feel free to post issues/PRs

NOASSERTION000

penn

Pitch Estimating Neural Networks (PENN)

MIT000

X-TF-GridNet

The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", which is accepted by Information Fusion.

000

peerRTF

robust RTFs by GCN

000

ears_dataset

Expressive Anechoic Recordings of Speech (EARS)

NOASSERTION000

SepReformer

Official repository of SepReformer for speech separation

000

torchcrepe

Pytorch implementation of the CREPE pitch tracker

MIT000

AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

NOASSERTION000

se-scaling

Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement"

000

SWiBE

000

silero-vad

Python Wrapper of Silero VAD

MIT000

SEtrain

A training code template for DNN-based speech enhancement.

000

BERP

The pytorch implementation of BERP: A Blind Estimator of Room acoustic and physical Parameters

GPL-3.0000

FSPEN

000

gtcrn

The official implementation of GTCRN, an ultra-lite speech enhancement model.

MIT000

FSPEN2

000

ddsp

DDSP: Differentiable Digital Signal Processing

Apache-2.0000