donghaiyw's repositories

auraloss

Collection of audio-focused loss functions in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

covarep

A Cooperative Voice Analysis Repository for Speech Technologies

Language:MATLABLicense:NOASSERTIONStargazers:0Issues:0Issues:0

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FastSpeech

The Implementation of FastSpeech based on pytorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

legacy_STRAIGHT

A vocoder framework which had been widely used in research community since 1999.

Language:MATLABLicense:Apache-2.0Stargazers:0Issues:1Issues:0

LPCTorch

LPC Utility for Pytorch Library.

Stargazers:0Issues:0Issues:0

magphase

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

melgan-neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

License:MITStargazers:0Issues:0Issues:0

nnAudio

Audio processing by using pytorch 1D convolution network

License:MITStargazers:0Issues:0Issues:0

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:0Issues:0Issues:0

ParallelWaveGAN

Unofficial Parallel WaveGAN implementation with Pytorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch-handbook

pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行

Stargazers:0Issues:0Issues:0

pytorch-revgrad

A minimal pytorch package implementing a gradient reversal layer.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Resemblyzer

A python package to analyze and compare voices with deep learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

segan_pytorch

Speech Enhancement Generative Adversarial Network in PyTorch

Stargazers:0Issues:0Issues:0

sonnet

TensorFlow-based neural network library

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

speech-denoiser

A speech denoise lv2 plugin based on RNNoise library

License:LGPL-3.0Stargazers:0Issues:0Issues:0

speech-resynthesis

An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

License:NOASSERTIONStargazers:0Issues:0Issues:0

speech_feature_extractor

Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language:PythonLicense:MPL-2.0Stargazers:0Issues:1Issues:0

UniversalVocoding

A PyTorch implementation of "Robust Universal Neural Vocoding"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

waveglow

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

wavegrad

A fast, high-quality neural vocoder.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

WaveRNN

Pytorch implementation of Deepmind's WaveRNN model

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

WaveRNN-1

A WaveRNN implementation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

WaveRNN-Pytorch

Fatcord's Alternative WaveRNN (Faster training)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0