yhzdsk's starred repositories
Transfer-Learning-Library
Transfer Learning Library for Domain Adaptation, Task Adaptation, and Domain Generalization
cs-self-learning
计算机自学指南
AudioClassification-Pytorch
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
resemble-enhance
AI powered speech denoising and enhancement
Wave-U-Net
Implementation of the Wave-U-Net for audio source separation
Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
Spider-Monkey-Whinny-Detection
Code to replicate the experiments of the Interspeech 2021 paper "Multi-Attentive Detection of the Spider Monkey Whinny in the (Actual) Wild".
speechbrain
A PyTorch-based Speech Toolkit
voxceleb_trainer
In defence of metric learning for speaker recognition
ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
DeepFilterNet
Noise supression using deep filtering
crnn-audio-classification
UrbanSound classification using Convolutional Recurrent Networks in PyTorch
Noise2Noise-audio_denoising_without_clean_training_data
Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoising networks using only noisy speech samples.
noisereduce
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
General-Purpose-Sound-Recognition-Demo
General purpose sound recognition demo
ESA-official
Robust Lane Detection via Expanded Self Attention (WACV 2022)