Jong-Jin Kim's repositories

Language:PythonLicense:AGPL-3.0Stargazers:1Issues:1Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

coqui-TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0

crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CVC

CVC: Contrastive Learning for Non-parallel Voice Conversion (submitted to ICASSP 2021, in PyTorch)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

efficient_tts

Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

GPM

Official Code Repository for "Gradient Projection Memory for Continual Learning"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

License:MITStargazers:0Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

KoSpeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

libebur128

A library implementing the EBU R128 loudness standard.

Language:CLicense:MITStargazers:0Issues:0Issues:0

magenta

Magenta: Music and Art Generation with Machine Intelligence

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

License:MITStargazers:0Issues:0Issues:0

ngrest

Fast and easy C++ RESTful WebServices framework

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

piper

A fast, local neural text to speech system

License:MITStargazers:0Issues:0Issues:0

pitchtron

TTS for pitch-accented language. Korean dialect DB.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

rainbow-memory

Official pytorch implementation of Rainbow Memory (CVPR 2021)

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

SC-GlowTTS

SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model

License:MITStargazers:0Issues:0Issues:0

SC-WaveRNN

Official PyTorch implementation of Speaker Conditional WaveRNN

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

test01010

test project

Stargazers:0Issues:0Issues:0

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Stargazers:0Issues:0Issues:0

VQ-VAE-Speech

PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]

Language:PythonLicense:MITStargazers:0Issues:0Issues:0