Yuhang's repositories

Auto-Tuning-Spectral-Clustering

This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"

License:MITStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

License:MITStargazers:0Issues:0Issues:0

CaoYuhang.github.io

blog website

Language:JavaScriptStargazers:0Issues:0Issues:0

unified2021

A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION

Stargazers:0Issues:0Issues:0

voice_activity_detection

Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)

License:MITStargazers:0Issues:0Issues:0

Teacher-free-Knowledge-Distillation

Knowledge Distillation: CVPR2020 Oral, Revisiting Knowledge Distillation via Label Smoothing Regularization

License:MITStargazers:0Issues:0Issues:0

tensorpack

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

License:Apache-2.0Stargazers:0Issues:0Issues:0

julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

License:BSD-3-ClauseStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

Awesome-Speech-Enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

License:MITStargazers:0Issues:0Issues:0

tcnse

TCN-based Speech Enhancement

Stargazers:0Issues:0Issues:0

DNS-Challenge

This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

Stargazers:0Issues:0Issues:0

AdaptiveFilterandActiveNoiseCancellation

Adaptive Filter and Active Noise Cancellation —— LMS, NLMS, RLS

Stargazers:0Issues:0Issues:0

rnnt-speech-recognition

End-to-end speech recognition using RNN Transducers in Tensorflow 2.0

License:MITStargazers:0Issues:0Issues:0

Looking-to-Listen-at-the-Cocktail-Party

Executable code based on Google articles

License:MITStargazers:0Issues:0Issues:0

Spherical-Array-Processing

A collection of MATLAB routines for acoustical array processing on spherical harmonic signals, commonly captured with a spherical microphone array.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

tutorials

PyTorch tutorials.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

DOA

DOA

Stargazers:0Issues:0Issues:0

DALI

A library containing both highly optimized building blocks and an execution engine for data pre-processing in deep learning applications

License:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-distributed

A quickstart and benchmark for pytorch distributed training.

License:MITStargazers:0Issues:0Issues:0

coherence

dual-mic noise reduction based on coherence function

Stargazers:0Issues:0Issues:0

DeepComplexUNetPyTorch

Implementation of Deep Complex UNet Using PyTorch

Stargazers:0Issues:0Issues:0

wave-samples

The wave samples for the paper of "End-to-End Post-filter for Speech Separation with Deep Attention Fusion Features"

Stargazers:0Issues:0Issues:0

conv-tasnet

A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation"

License:MITStargazers:0Issues:0Issues:0

audio-visual-speech-enhancement

Official Implementation of "Visual Speech Enhancement", Interspeech 2018.

Stargazers:0Issues:0Issues:0

espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

License:NOASSERTIONStargazers:0Issues:0Issues:0