MinSang Baek's repositories

Audio-visual-sound-localization

Audio-visual sound localization

Language:PythonStargazers:0Issues:0Issues:0

AudioFile

A simple C++ library for reading and writing audio files.

Language:C++License:MITStargazers:0Issues:0Issues:0

awesome-mac

 Now we have become very big, Different from the original idea. Collect premium software in various categories.

Language:JavaScriptLicense:CC0-1.0Stargazers:0Issues:0Issues:0

clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CMGAN

Conformer-based Metric GAN for speech enhancement

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

cocktail-fork-separation

Baseline multi-resolution cross network model trained using the Divide and Remaster Dataset

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DeepWaveTorch

DeepWave: A Recurrent Neural-Network for Real-Time Acoustic Imaging (PyTorch implementation)

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

DisVoice

feature extraction from speech signals

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

DnnNormTimeFreq4DoA

A DNN based Normalized Time-frequency Weighted Criterion for Robust Wideband DoA Estimation

Language:PythonStargazers:0Issues:0Issues:0

ffc_se

Code for the paper "FFC-SE: Fast Fourier Convolution for Speech Enhancement" (published at Interspeech 2022 conference)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

fundsp

Audio DSP library for audio processing and synthesis

Language:RustLicense:Apache-2.0Stargazers:0Issues:0Issues:0

gss

A simple package for Guided source separation (GSS)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ivy

The Unified Machine Learning Framework

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Learning_Neural_Acoustic_Fields

Official code for "Learning Neural Acoustic Fields"

Language:PythonStargazers:0Issues:0Issues:0

Multi-clue-TSE-data

Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

NKF-AEC

Acoustic Echo Cancellation with Nerual Kalman Filtering

Language:HTMLStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

opensmile

The Munich Open-Source Large-Scale Multimedia Feature Extractor

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

padertorch

A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an emphasis on speech processing.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0

shap

A game theoretic approach to explain the output of any machine learning model.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

sigsep-mus-eval

museval - source separation evaluation tools for python

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

sms_wsj

SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

sound-spaces

A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:0Issues:0Issues:0

visqol

Perceptual Quality Estimator for speech and audio

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0