BaekMS

followers

following

stars

MinSang Baek's repositories

Audio-visual-sound-localization

Audio-visual sound localization

Language:Python000

AudioFile

A simple C++ library for reading and writing audio files.

Language:C++MIT000

awesome-mac

 Now we have become very big, Different from the original idea. Collect premium software in various categories.

Language:JavaScriptCC0-1.0000

clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

Language:PythonMIT000

CMGAN

Conformer-based Metric GAN for speech enhancement

Language:PythonMIT000

cocktail-fork-separation

Baseline multi-resolution cross network model trained using the Divide and Remaster Dataset

Language:PythonMIT000

ControlNet

Let us control diffusion models!

Language:PythonApache-2.0000

DeepWaveTorch

DeepWave: A Recurrent Neural-Network for Real-Time Acoustic Imaging (PyTorch implementation)

Language:Jupyter NotebookCC-BY-4.0000

DisVoice

feature extraction from speech signals

Language:Jupyter NotebookMIT000

DnnNormTimeFreq4DoA

A DNN based Normalized Time-frequency Weighted Criterion for Robust Wideband DoA Estimation

Language:Python000

ffc_se

Code for the paper "FFC-SE: Fast Fourier Convolution for Speech Enhancement" (published at Interspeech 2022 conference)

Language:PythonNOASSERTION000

fundsp

Audio DSP library for audio processing and synthesis

Language:RustApache-2.0000

gss

A simple package for Guided source separation (GSS)

Language:PythonMIT000

ivy

The Unified Machine Learning Framework

Language:PythonApache-2.0000

Learning_Neural_Acoustic_Fields

Official code for "Learning Neural Acoustic Fields"

Language:Python000

Multi-clue-TSE-data

Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"

Language:PythonMIT000

nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation

Language:PythonMIT000

NKF-AEC

Acoustic Echo Cancellation with Nerual Kalman Filtering

Language:HTML000

Ny-EnhTT

MIT000

opensmile

The Munich Open-Source Large-Scale Multimedia Feature Extractor

Language:C++NOASSERTION000

padertorch

A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an emphasis on speech processing.

Language:PythonMIT000

SCTK

Language:CNOASSERTION000

shap

A game theoretic approach to explain the output of any machine learning model.

Language:Jupyter NotebookMIT000

sigsep-mus-eval

museval - source separation evaluation tools for python

Language:PythonMIT000

sms_wsj

SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition

Language:PythonMIT000

sound-spaces

A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.

Language:PythonCC-BY-4.0000

TAPLoss

Language:PythonMIT000

TF-FaSNet

Language:PythonMIT000

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

NOASSERTION000

visqol

Perceptual Quality Estimator for speech and audio

Language:C++Apache-2.0000