vocoder

There are 13 repositories under vocoder topic.

coqui-ai / TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
deep-learning glow-tts hifigan melgan multi-speaker-tts python pytorch speaker-encoder speaker-encodings speech speech-synthesis tacotron text-to-speech tts tts-model vocoder voice-cloning voice-conversion voice-synthesis
Language:Python 29828
PaddlePaddle / PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
transformer conformer speech-translation streaming-asr speech-alignment punctuation-restoration streaming-tts speech-synthesis tts asr kws speech-recognition sound-classification voice-cloning vocoder voice-recognition self-supervised-learning wav2vec2 whisper code-switch
Language:Python 10246
mozilla / TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
deep-learning text-to-speech python pytorch tacotron tts speaker-encoder dataset-analysis tacotron2 tensorflow2 vocoder melgan gantts multiband-melgan glow-tts speech
Language:Jupyter Notebook 8868
TensorSpeech / TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
speech-synthesis text-to-speech tensorflow2 melgan fastspeech real-time tts vocoder multi-speaker-tts fastspeech2 multiband-melgan tacotron2 parallel-wavegan tflite mobile-tts zh-tts chinese-tts korea-tts german-tts japanese-tts
Language:Python 3715
jik876 / hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
speech-synthesis gan text-to-speech tts deep-learning hifi-gan pytorch vocoder
Language:Python 1773
kan-bayashi / ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
speech-synthesis neural-vocoder text-to-speech pytorch wavenet parallel-wavenet realtime tts melgan vocoder hifigan style-melgan
Language:Jupyter Notebook 1486
mmorise / World
A high-quality speech analysis, manipulation and synthesis system
speech-analysis speech-synthesis vocoder
Language:C++ 1129
haoheliu / voicefixer
General Speech Restoration
speech-processing speech-synthesis speech-enhancement speech-analysis speech tts declipping dereverberation denoise super-resolution vocoder mel
Language:Python 920
lmnt-com / diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
machine-learning text-to-speech neural-network paper pytorch speech-synthesis diffwave vocoder speech pretrained-models tts deep-learning
Language:Python 727
gemelo-ai / vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
vocoder vocos
Language:Python 566
ivanvovk / WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
wavegrad vocoder text-to-speech tts tts-engines speech speech-synthesis ljspeech probabilistic-models diffusion-models
Language:Jupyter Notebook 397
Rongjiehuang / FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
ijcai2022 neural-vocoder speech-synthesis text-to-speech vocoder
Language:Python 390
rishikksh20 / VocGAN
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
vocoder gan melgan vocgan speech-synthesis text-to-speech speech-processing
Language:Python 318
szechyjs / mbelib
P25 Phase 1 and ProVoice vocoder
c vocoder
Language:C++ 271
lmnt-com / wavegrad
A fast, high-quality neural vocoder.
machine-learning neural-network speech-synthesis text-to-speech wavegrad paper pytorch vocoder speech pretrained-models tts deep-learning
Language:Python 266
maum-ai / univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
text-to-speech vocoder gan deep-learning pytorch tts speech-synthesis
Language:Python 257
rishikksh20 / iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
vocoder tts speech-synthesis
Language:Python 208
sh123 / codec2_talkie
Turn your Android phone into Amateur Radio Codec2/OPUS APRS enabled DV handheld transceiver (Bluetooth/BLE/USB/TCPIP KISS/Sound modem client for DV digital voice communication)
codec2 kiss freedv amateur-radio amateurradio ham-radio digital-voice vocoder bluetooth vhf uhf hf radio lora digital fm dv walkie-talkie aprs opus
Language:Java 200
NTT123 / vietTTS
Vietnamese Text to Speech library
deep-learning hifi-gan tacotron text-to-speech tts-engines vietnam vietnamese vocoder
Language:Python 184
maum-ai / phaseaug
ICASSP 2023 Accepted
gan speech-synthesis vocoder
Language:Python 183
descriptinc / cargan
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
audio gan autoregression vocoder
Language:Python 180
HidekiKawahara / legacy_STRAIGHT
A vocoder framework which had been widely used in research community since 1999.
speech-analysis speech-synthesis vocoder
Language:MATLAB 171
k2kobayashi / crank
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
speech-synthesis voice-conversion vqvae adversarial-learning cyclic-constraints vocoder
Language:Python 167
hhguo / MSMC-TTS
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
deep-learning speech-synthesis tts vocoder gan text-to-speech vq-vae vqgan
Language:Python 156
xcmyz / FastVocoder
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
hifigan melgan vocoder speech-synthesis
Language:Python 154
ncsoft / avocodo
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
avocodo gan pytorch vocoder
Language:Python 148
Rongjiehuang / Multi-Singer
PyTorch Implementation of Multi-Singer (ACM-MM'21)
text-to-speech vocoder singing-voice-synthesis speech-synthesis tts
Language:Python 138
geneing / WaveRNN-Pytorch
Fatcord's Alternative WaveRNN (Faster training)
tts-engines vocoder tacotron-2
Language:Python 134
rishikksh20 / Avocodo-pytorch
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
hifi-gan speech-synthesis text-to-speech tts vocoder pytorch avocodo gan generative-adversarial-network
Language:Python 114
yl4579 / HiFTNet
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
deep-learning speech-synthesis text-to-speech tts vocoder vocoders
Language:Python 108
jurihock / stftPitchShift
STFT based real-time pitch and timbre shifting in C++ and Python
audio dsp fft stft vocoder pitch-shifting audio-processing cpp algorithms smbpitchshift python formants audio-effect stftpitchshift voice timbre pitch dafx plugin realtime
Language:C 103
rishikksh20 / Fre-GAN-pytorch
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
vocoder tts text-to-speech speech-synthesis speech
Language:Python 99
X-LANCE / UniCATS-CTX-vec2wav
[AAAI 2024] Code for CTX-vec2wav in UniCATS
speech-synthesis vocoder unicats vocoding semantic-token self-supervised-speech
Language:Python 98
magnetophon / VoiceOfFaust
Turn your voice into a synthesizer!
pure-data faust dsp voice synth vocoder
Language:Faust 95
syang1993 / FFTNet
A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder
fftnet vocoder
Language:Python 92
philsyn / DiffWave-Vocoder
Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.
diffwave diffwave-vocoder pytorch speech speech-synthesis text-to-speech tts vocoder
Language:Python 84

vocoder

coqui-ai / TTS

PaddlePaddle / PaddleSpeech

mozilla / TTS

TensorSpeech / TensorFlowTTS

jik876 / hifi-gan

kan-bayashi / ParallelWaveGAN

mmorise / World

haoheliu / voicefixer

lmnt-com / diffwave

gemelo-ai / vocos

ivanvovk / WaveGrad

Rongjiehuang / FastDiff

rishikksh20 / VocGAN

szechyjs / mbelib

lmnt-com / wavegrad

maum-ai / univnet

rishikksh20 / iSTFTNet-pytorch

sh123 / codec2_talkie

NTT123 / vietTTS

maum-ai / phaseaug

descriptinc / cargan

HidekiKawahara / legacy_STRAIGHT

k2kobayashi / crank

hhguo / MSMC-TTS

xcmyz / FastVocoder

ncsoft / avocodo

Rongjiehuang / Multi-Singer

geneing / WaveRNN-Pytorch

rishikksh20 / Avocodo-pytorch

yl4579 / HiFTNet

jurihock / stftPitchShift

rishikksh20 / Fre-GAN-pytorch

X-LANCE / UniCATS-CTX-vec2wav

magnetophon / VoiceOfFaust

syang1993 / FFTNet

philsyn / DiffWave-Vocoder