Yuhang's repositories
Auto-Tuning-Spectral-Clustering
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
CaoYuhang.github.io
blog website
unified2021
A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION
voice_activity_detection
Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)
Teacher-free-Knowledge-Distillation
Knowledge Distillation: CVPR2020 Oral, Revisiting Knowledge Distillation via Label Smoothing Regularization
tensorpack
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
tcnse
TCN-based Speech Enhancement
DNS-Challenge
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.
Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
AdaptiveFilterandActiveNoiseCancellation
Adaptive Filter and Active Noise Cancellation —— LMS, NLMS, RLS
rnnt-speech-recognition
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Looking-to-Listen-at-the-Cocktail-Party
Executable code based on Google articles
Spherical-Array-Processing
A collection of MATLAB routines for acoustical array processing on spherical harmonic signals, commonly captured with a spherical microphone array.
tutorials
PyTorch tutorials.
PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
DOA
DOA
DALI
A library containing both highly optimized building blocks and an execution engine for data pre-processing in deep learning applications
pytorch-distributed
A quickstart and benchmark for pytorch distributed training.
coherence
dual-mic noise reduction based on coherence function
DeepComplexUNetPyTorch
Implementation of Deep Complex UNet Using PyTorch
wave-samples
The wave samples for the paper of "End-to-End Post-filter for Speech Separation with Deep Attention Fusion Features"
conv-tasnet
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation"
audio-visual-speech-enhancement
Official Implementation of "Visual Speech Enhancement", Interspeech 2018.
espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit