donghaiyw's repositories
auraloss
Collection of audio-focused loss functions in PyTorch
covarep
A Cooperative Voice Analysis Repository for Speech Technologies
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
espnet
End-to-End Speech Processing Toolkit
FastSpeech
The Implementation of FastSpeech based on pytorch.
legacy_STRAIGHT
A vocoder framework which had been widely used in research community since 1999.
LPCTorch
LPC Utility for Pytorch Library.
magphase
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
nnAudio
Audio processing by using pytorch 1D convolution network
paper-reading
深度学习经典、新论文逐段精读
ParallelWaveGAN
Unofficial Parallel WaveGAN implementation with Pytorch
PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
pytorch-handbook
pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行
pytorch-revgrad
A minimal pytorch package implementing a gradient reversal layer.
Resemblyzer
A python package to analyze and compare voices with deep learning
segan_pytorch
Speech Enhancement Generative Adversarial Network in PyTorch
speech-denoiser
A speech denoise lv2 plugin based on RNNoise library
speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
speech_feature_extractor
Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.
UniversalVocoding
A PyTorch implementation of "Robust Universal Neural Vocoding"
waveglow
A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
WaveRNN-1
A WaveRNN implementation
WaveRNN-Pytorch
Fatcord's Alternative WaveRNN (Faster training)