BaekMS

MinSang Baek's repositories

AcousticSwarms-Robots

Language:C000

AcousticSwarms-Speech

Language:PythonMIT000

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonMIT000

audiosocket

Simple bidirectional audio protocol

Apache-2.0000

awesome-python-scientific-audio

Curated list of python software and packages related to scientific research in audio

000

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Language:C++MPL-2.0000

DPTBF

Language:PythonGPL-3.0000

FN-SSL

PyTorch implementation of "FN-SSL: Full-Band and Narrow-Band Fusion for Sound Source Localization." [INTERSPEECH 2023]

Language:Python000

FQSE

Fully Quantized Neural Networks For Speech Enhancement

Language:PythonApache-2.0000

FullSubNet-plus

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Apache-2.0000

LPCNet

Efficient neural speech synthesis

Language:CBSD-3-Clause000

meeteval

Language:PythonMIT000

ml-spatial-librispeech

A large synthetic dataset of spatial audio with multiple labels

NOASSERTION000

MULTI-AUDIODEC

This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.

Language:Python000

MultiMetricGANplusplus

Language:Python000

mvdrpf

Language:PythonMIT000

NeMo

NeMo: a toolkit for conversational AI

Language:PythonApache-2.0000

NeuralSpeech

Language:PythonMIT000

nfs-binaural

MIT000

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonMIT000

nussl

A flexible source separation library in Python

Language:PythonMIT000

open-unmix-pytorch

Open-Unmix - Music Source Separation for PyTorch

Language:PythonMIT000

pulse

A Pytorch implementation of "Audio signal enhancement with learning from positive and unlabelled data"

Language:PythonMIT000

pydiogment

:mega: Python library for audio augmentation

Language:PythonBSD-3-Clause000

SC-Wind-Noise-Generator

Generate synthetic wind noise signals based on a wind speed profile.

Language:PythonMIT000

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonMIT000

SRMRpy

Python implementation of the SRMR toolbox

Language:PythonNOASSERTION000

sydra

Language:PythonMIT000

TDANet

An efficient speech separation method

Language:PythonApache-2.0000

torch-pesq

PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio

Language:PythonMIT000