Samuel Samsudin Ng's repositories

speech_emo_recognition

Speech emotion recognition models using fully-convolutional and convolutional-recurrent models

Language:PythonStargazers:4Issues:1Issues:0

cv_histogram_equalization

Python implementation of global histogram equalization

Language:Jupyter NotebookStargazers:1Issues:2Issues:0
Language:HTMLLicense:Apache-2.0Stargazers:1Issues:1Issues:0

aps

A personal toolkit for single/multi-channel speech recognition & enhancement & separation.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audacity

Audio Editor

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0

awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

License:GPL-2.0Stargazers:0Issues:0Issues:0

ConferencingSpeech2021

Conferencing Speech Challenge

License:Apache-2.0Stargazers:0Issues:0Issues:0

convNet.pytorch

ConvNet training using pytorch

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

DCRNN

Implementation of Diffusion Convolutional Recurrent Neural Network in Tensorflow

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

License:MITStargazers:0Issues:0Issues:0

kissfft

a Fast Fourier Transform (FFT) library that tries to Keep it Simple, Stupid

Language:CLicense:NOASSERTIONStargazers:0Issues:1Issues:0

label-smoothing-pytorch

label smoothing PyTorch implementation

License:MITStargazers:0Issues:0Issues:0

libsndfile

A C library for reading and writing sound files containing sampled audio data.

License:LGPL-2.1Stargazers:0Issues:0Issues:0
Stargazers:0Issues:2Issues:0

openMHA

The open Master Hearing Aid (openMHA)

License:AGPL-3.0Stargazers:0Issues:0Issues:0

portaudio

PortAudio is a cross-platform, open-source C language library for real-time audio input and output.

License:NOASSERTIONStargazers:0Issues:0Issues:0

portaudio-binaries

Pre-compiled shared libraries for PortAudio

Stargazers:0Issues:0Issues:0

pytorch-template

PyTorch deep learning projects made easy.

License:MITStargazers:0Issues:0Issues:0

sndfilter

Algorithms for sound filters, like reverb, dynamic range compression, lowpass, highpass, notch, etc

Language:CLicense:MITStargazers:0Issues:1Issues:0

speech-accent-recognition-v2

Speech accent recognition with image classification technique

Language:PythonStargazers:0Issues:0Issues:0

Speech-Enhancement-Measures

speech enhancement metrics:CSIG, CBAK, CMOS, SSNR, PESQ, STOI, ESTOI, SNR, IS, LLR, WSS

Language:MATLABStargazers:0Issues:1Issues:0

speech_accent_recognition

Accent classification from speech spectral images with image classifier

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Spherical-Array-Processing

A collection of MATLAB routines for acoustical array processing on spherical harmonic signals, commonly captured with a spherical microphone array.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

stackbit-theme-fresh

Fresh a personal theme with a blog for Stackbit

Stargazers:0Issues:0Issues:0

utils.pytorch

Utilities for Pytorch

License:MITStargazers:0Issues:0Issues:0

VGGVox-PyTorch

Implementing VGGVox for VoxCeleb1 dataset in PyTorch.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

wavencoder

WavEncoder is a Python library for encoding raw audio with PyTorch backend.

License:MITStargazers:0Issues:0Issues:0