There are 16 repositories under vocoder topic.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
vits2 backbone with multilingual-bert
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
General Speech Restoration
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
PyTorch Implementation of FastDiff (IJCAI'22)
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
Turn your Android phone into Amateur Radio Codec2/OPUS APRS enabled DV handheld transceiver (Bluetooth/BLE/USB/TCPIP KISS/Sound modem client for DV digital voice communication)
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
A vocoder framework which had been widely used in research community since 1999.
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
PyTorch Implementation of Multi-Singer (ACM-MM'21)
STFT based real-time pitch and timbre shifting in C++ and Python
Fatcord's Alternative WaveRNN (Faster training)
[AAAI 2024] Code for CTX-vec2wav in UniCATS
Avocodo: Generative Adversarial Network for Artifact-free Vocoder