Jong-Jin Kim's repositories

Language:PythonLicense:AGPL-3.0Stargazers:1Issues:1Issues:0

autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Crossbow

Crossbow: A Multi-GPU Deep Learning System for Training with Small Batch Sizes

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CVC

CVC: Contrastive Learning for Non-parallel Voice Conversion (submitted to ICASSP 2021, in PyTorch)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

EA-SVC

An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

efficient_tts

Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Glow_TTS

An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

GPM

Official Code Repository for "Gradient Projection Memory for Continual Learning"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

KoSpeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

libebur128

A library implementing the EBU R128 loudness standard.

Language:CLicense:MITStargazers:0Issues:0Issues:0

magenta

Magenta: Music and Art Generation with Machine Intelligence

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ngrest

Fast and easy C++ RESTful WebServices framework

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

pitchtron

TTS for pitch-accented language. Korean dialect DB.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

PyTSMod

An open-source Python library for audio time-scale modification.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

rainbow-memory

Official pytorch implementation of Rainbow Memory (CVPR 2021)

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

SC-GlowTTS

SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model

License:MITStargazers:0Issues:0Issues:0

SC-WaveRNN

Official PyTorch implementation of Speaker Conditional WaveRNN

Language:PythonStargazers:0Issues:0Issues:0

SkipVQVC

An implementation of SkipVQVC with various settings.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

speaker_embeddings_GE2E

PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Text-to-Speech

Tacotron + WaveRNN Vocoder

License:MITStargazers:0Issues:0Issues:0

VocGAN

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

vq-vae-2-pytorch

Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

VQ-VAE-Speech

PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]

Language:PythonLicense:MITStargazers:0Issues:0Issues:0