samsudinng

followers

following

stars

Samuel Samsudin Ng's repositories

speech_emo_recognition

Speech emotion recognition models using fully-convolutional and convolutional-recurrent models

Language:Python4 10

cv_histogram_equalization

Python implementation of global histogram equalization

Language:Jupyter Notebook1 20

DeepComplexCRN

Language:HTMLApache-2.01 10

aps

A personal toolkit for single/multi-channel speech recognition & enhancement & separation.

Language:PythonApache-2.0000

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonMIT000

audacity

Audio Editor

Language:CNOASSERTION000

awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

GPL-2.0000

ConferencingSpeech2021

Conferencing Speech Challenge

Apache-2.0000

convNet.pytorch

ConvNet training using pytorch

MIT000

cv_stereo_maps

000

DCRNN

Implementation of Diffusion Convolutional Recurrent Neural Network in Tensorflow

Language:PythonMIT010

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonNOASSERTION010

FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

MIT000

kissfft

a Fast Fourier Transform (FFT) library that tries to Keep it Simple, Stupid

Language:CNOASSERTION010

label-smoothing-pytorch

label smoothing PyTorch implementation

MIT000

libsndfile

A C library for reading and writing sound files containing sampled audio data.

LGPL-2.1000

myweb

020

openMHA

The open Master Hearing Aid (openMHA)

AGPL-3.0000

portaudio

PortAudio is a cross-platform, open-source C language library for real-time audio input and output.

NOASSERTION000

portaudio-binaries

Pre-compiled shared libraries for PortAudio

000

pytorch-template

PyTorch deep learning projects made easy.

MIT000

sndfilter

Algorithms for sound filters, like reverb, dynamic range compression, lowpass, highpass, notch, etc

Language:CMIT010

speech-accent-recognition-v2

Speech accent recognition with image classification technique

Language:Python000

Speech-Enhancement-Measures

speech enhancement metrics：CSIG, CBAK, CMOS, SSNR, PESQ, STOI, ESTOI, SNR, IS, LLR, WSS

Language:MATLAB010

speech_accent_recognition

Accent classification from speech spectral images with image classifier

Language:Jupyter Notebook000

Spherical-Array-Processing

A collection of MATLAB routines for acoustical array processing on spherical harmonic signals, commonly captured with a spherical microphone array.

BSD-3-Clause000

stackbit-theme-fresh

Fresh a personal theme with a blog for Stackbit

000

utils.pytorch

Utilities for Pytorch

MIT000

VGGVox-PyTorch

Implementing VGGVox for VoxCeleb1 dataset in PyTorch.

Language:PythonMIT010

wavencoder

WavEncoder is a Python library for encoding raw audio with PyTorch backend.

MIT000