Minsu Kang's repositories
Pytorch-VAE-tutorial
A simple tutorial of Variational AutoEncoders with Pytorch
Pytorch-Diffusion-Model-Tutorial
A simple tutorial of Diffusion Probabilistic Models
VQVC-Pytorch
An unofficial implementation of Vector Quantization Voice Conversion (VQVC).
Awesome-DL-based-Text-to-speech-Papers-and-Resources
Various Text-to-speech (TTS) papers based on Deep-learning
Prosody-augmentation-for-Text-to-speech
Simple tool for speech dataset augmentation for modeling various prosodies.
Korean-phoneme-dictionary-generator
Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)
Pytorch-GAN-Tutorial
Various GANs implementations using pytorch
jackson-kang.github.io
a github homepage of Jackson
Speech-dataset-generator
Simple implementation of speech dataset generator for deep-learning based ASR and TTS
SpeechDatasetSplitter
A simple waveform segmentator using OpenAI's Whisper
DeepConvolutionalTTS-pytorch
Deep Convolutional TTS pytorch implementation
Pytorch-implementation-of-MobileNet-v1
Simple pytorch implementation of MobileNet v1 (A. G. Howard et. al., 2017)
SuperSeg-pytorch
An implementation of SuperSeg, a deep-learning based boundary detection model.
18-2_Machine-Learning
Repository for 18-2 Machine learning class, Handong Global University
Korean-Text-Image-Generator
Korean text-image data generator (한국어 글자 이미지 데이터 생성기)
SEAL_Renewal
리뉴얼된 SEAL
VectorQuantizedCPC
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
Algorithm-Practice
알고리즘 연습
CRNN_Tensorflow
Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition
multi-speaker-tacotron-tensorflow
Multi-speaker Tacotron in TensorFlow.
Recitations
for recitation preparation
Tacotron-pytorch
Tacotron implementation with pytorch 1.0