There are 0 repository under melgan topic.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:
MelGAN implementation with Multi-Band and Full Band supports...
Ultrafast GAN based Vocoder for Text to Speech
Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.
Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"
MelGAN Multi GPU Implementation.
🐸💬 Coqui TTS Double Decoder Consistency samples
Catalan Text to Speech
A neural network (GAN) trained to apply metal screaming effects, turning vocals from songs, speeches or whispers into realistic screams and growls.
SE-MelGAN - Speaker Agnostic Rapid Speech Enhancement