Jong-Jin Kim's repositories
bark
🔊 Text-Prompted Generative Audio Model
coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
crank
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
efficient_tts
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
EmoSphere-TTS
The official implementation of EmoSphere-TTS
F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
GPM
Official Code Repository for "Gradient Projection Memory for Continual Learning"
hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
KoSpeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
libebur128
A library implementing the EBU R128 loudness standard.
magenta
Magenta: Music and Art Generation with Machine Intelligence
MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
ngrest
Fast and easy C++ RESTful WebServices framework
piper
A fast, local neural text to speech system
pitchtron
TTS for pitch-accented language. Korean dialect DB.
rainbow-memory
Official pytorch implementation of Rainbow Memory (CVPR 2021)
SC-GlowTTS
SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model
SC-WaveRNN
Official PyTorch implementation of Speaker Conditional WaveRNN
StyleSpeech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
VQ-VAE-Speech
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]