Jackson-Kang

Minsu Kang's repositories

Pytorch-VAE-tutorial

A simple tutorial of Variational AutoEncoders with Pytorch

Language:Jupyter Notebook367 3 4

Pytorch-Diffusion-Model-Tutorial

A simple tutorial of Diffusion Probabilistic Models

Language:Jupyter NotebookMIT87 2 2

MFARunner

A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.

Language:Jupyter NotebookMIT44 4 1

VQVC-Pytorch

An unofficial implementation of Vector Quantization Voice Conversion (VQVC).

Language:PythonMIT29 2 2

Awesome-DL-based-Text-to-speech-Papers-and-Resources

Various Text-to-speech (TTS) papers based on Deep-learning

14 20

Prosody-augmentation-for-Text-to-speech

Simple tool for speech dataset augmentation for modeling various prosodies.

Language:PythonMIT14 30

Korean-phoneme-dictionary-generator

Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)

Language:PythonMIT13 2 1

Pytorch-GAN-Tutorial

Various GANs implementations using pytorch

Language:Jupyter Notebook9 20

jackson-kang.github.io

a github homepage of Jackson

Language:HTML4 20

Pytorch-Conditional-Flow-Matching-Tutorial

A Pytorch tutorial of Conditional Flow Matching[Lipman22] using MNIST dataset.

Language:Jupyter NotebookMIT4 10

Speech-dataset-generator

Simple implementation of speech dataset generator for deep-learning based ASR and TTS

Language:PythonMIT4 10

SpeechDatasetSplitter

A simple waveform segmentator using OpenAI's Whisper

Language:PythonMIT4 20

Jackson-Kang

3 2 1

SuperSeg-pytorch

An implementation of SuperSeg, a deep-learning based boundary detection model.

MIT3 30

DeepConvolutionalTTS-pytorch

Deep Convolutional TTS pytorch implementation

Language:Python2 20

images

images for paper review

2 20

Pytorch-implementation-of-MobileNet-v1

Simple pytorch implementation of MobileNet v1 (A. G. Howard et. al., 2017)

Language:Python2 20

18-2_Machine-Learning

Repository for 18-2 Machine learning class, Handong Global University

Language:Python1 10

Korean-Text-Image-Generator

Korean text-image data generator (한국어 글자 이미지 데이터 생성기)

Language:Python1 10

SEAL_Renewal

리뉴얼된 SEAL

Language:JavaScript1 30

VectorQuantizedCPC

Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion

Language:PythonMIT1 10

Algorithm-Practice

알고리즘 연습

Language:C020

CRNN_Tensorflow

Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition

Language:Python020

HangulDB

Language:C++020

multi-speaker-tacotron-tensorflow

Multi-speaker Tacotron in TensorFlow.

Language:PythonNOASSERTION000

Recitations

for recitation preparation

Language:C020

speech.ko

Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language

Language:Shell010

Tacotron-pytorch

Tacotron implementation with pytorch 1.0

Language:Python010