There are 2 repositories under hifi-gan topic.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
TTS models for Arabic (Tacotron2, FastPitch)
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.
Audio samples from "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis"
In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system :smile: In general, I used Portaspeech as an acoustic model and iSTFTNet as vocoder...
TTS (FastPitch) for German
Python package for NSF and NSF-HiFi-GAN (unofficial)
Catalan Text to Speech
포스코 청년 AI·Big Data 아카데미 - AI 프로젝트
Aligning latent space of speaking style with human perception using a re-embedding strategy
If you have a wav & transcript, can train HiFi-GAN right now.