MinSang Baek's repositories

Language:CStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audiosocket

Simple bidirectional audio protocol

License:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-python-scientific-audio

Curated list of python software and packages related to scientific research in audio

Stargazers:0Issues:0Issues:0

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Language:C++License:MPL-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

FN-SSL

PyTorch implementation of "FN-SSL: Full-Band and Narrow-Band Fusion for Sound Source Localization." [INTERSPEECH 2023]

Language:PythonStargazers:0Issues:0Issues:0

FQSE

Fully Quantized Neural Networks For Speech Enhancement

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FullSubNet-plus

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

License:Apache-2.0Stargazers:0Issues:0Issues:0

LPCNet

Efficient neural speech synthesis

Language:CLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ml-spatial-librispeech

A large synthetic dataset of spatial audio with multiple labels

License:NOASSERTIONStargazers:0Issues:0Issues:0

MULTI-AUDIODEC

This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

NeMo

NeMo: a toolkit for conversational AI

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

nussl

A flexible source separation library in Python

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

open-unmix-pytorch

Open-Unmix - Music Source Separation for PyTorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pulse

A Pytorch implementation of "Audio signal enhancement with learning from positive and unlabelled data"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pydiogment

:mega: Python library for audio augmentation

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

SC-Wind-Noise-Generator

Generate synthetic wind noise signals based on a wind speed profile.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SRMRpy

Python implementation of the SRMR toolbox

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TDANet

An efficient speech separation method

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

torch-pesq

PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio

Language:PythonLicense:MITStargazers:0Issues:0Issues:0