BaekMS

MinSang Baek's repositories

3D-SE-Diffusion

Language:Python000

3D-Speaker

A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.

Language:PythonApache-2.0000

AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

NOASSERTION000

audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Language:PythonMIT000

BERP

The pytorch implementation of BERP: A Blind Estimator of Room acoustic and physical Parameters

GPL-3.0000

DDDM-VC

Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)

000

ddsp

DDSP: Differentiable Digital Signal Processing

Language:PythonApache-2.0000

DeepWaveDOA

ICASSP 2024: Robust DOA estimation from deep acoustic imaging

000

ears_dataset

Expressive Anechoic Recordings of Speech (EARS)

NOASSERTION000

FSPEN

000

FSPEN2

000

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

MIT000

gtcrn

The official implementation of GTCRN, an ultra-lite speech enhancement model.

MIT000

NOTSOFAR1-Challenge

NOTSOFAR-1 Challenge: Distant Diarization and ASR

MIT000

peerRTF

robust RTFs by GCN

000

penn

Pitch Estimating Neural Networks (PENN)

MIT000

pykaldi

A Python wrapper for Kaldi

Apache-2.0000

pyneuralfx

MIT000

PySDR

PySDR.org textbook source material, feel free to post issues/PRs

NOASSERTION000

se-scaling

Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement"

000

SepReformer

Official repository of SepReformer for speech separation

000

SEtrain

A training code template for DNN-based speech enhancement.

000

silero-vad

Python Wrapper of Silero VAD

MIT000

speech_evaluation

A toolkit dedicate for speech evaluation.

Apache-2.0000

SPMamba

000

SWiBE

000

tf-locoformer

Transformer with Local Modeling by Convolution for Speech Separation and Enhancement

Apache-2.0000

torchcrepe

Pytorch implementation of the CREPE pitch tracker

MIT000

webMUSHRA

a MUSHRA compliant web audio API based experiment software

NOASSERTION000

wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Apache-2.0000