Samuel Samsudin Ng's repositories
speech_emo_recognition
Speech emotion recognition models using fully-convolutional and convolutional-recurrent models
cv_histogram_equalization
Python implementation of global histogram equalization
aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
asteroid
The PyTorch-based audio source separation toolkit for researchers
audacity
Audio Editor
awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
ConferencingSpeech2021
Conferencing Speech Challenge
convNet.pytorch
ConvNet training using pytorch
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
label-smoothing-pytorch
label smoothing PyTorch implementation
libsndfile
A C library for reading and writing sound files containing sampled audio data.
openMHA
The open Master Hearing Aid (openMHA)
portaudio
PortAudio is a cross-platform, open-source C language library for real-time audio input and output.
portaudio-binaries
Pre-compiled shared libraries for PortAudio
pytorch-template
PyTorch deep learning projects made easy.
speech-accent-recognition-v2
Speech accent recognition with image classification technique
Speech-Enhancement-Measures
speech enhancement metrics:CSIG, CBAK, CMOS, SSNR, PESQ, STOI, ESTOI, SNR, IS, LLR, WSS
speech_accent_recognition
Accent classification from speech spectral images with image classifier
Spherical-Array-Processing
A collection of MATLAB routines for acoustical array processing on spherical harmonic signals, commonly captured with a spherical microphone array.
stackbit-theme-fresh
Fresh a personal theme with a blog for Stackbit
utils.pytorch
Utilities for Pytorch
VGGVox-PyTorch
Implementing VGGVox for VoxCeleb1 dataset in PyTorch.
wavencoder
WavEncoder is a Python library for encoding raw audio with PyTorch backend.