Sanghwa Ham's starred repositories
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
auto-sklearn
Automated Machine Learning with scikit-learn
PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
onnx-tensorrt
ONNX-TensorRT: TensorRT backend for ONNX
open-unmix-pytorch
Open-Unmix - Music Source Separation for PyTorch
how-do-vits-work
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
Neural-Voice-Cloning-With-Few-Samples
This repository has implementation for "Neural Voice Cloning With Few Samples"
DnCNN-PyTorch
PyTorch implementation of the TIP2017 paper "Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising"
DL_Compiler
Study Group of Deep Learning Compiler
w2v2-speaker
Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053
SpeechSynthesis
음성합성 관련 자료 모음
MelSpecVAE
Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis
chafon-rfid
Read RFID data from Chafon UHF readers
video_autoencoder
Video lstm auto encoder built with pytorch. https://arxiv.org/pdf/1502.04681.pdf
mimic-my-voice
[WIP] Create a Text to Speech Engine using Your Own Voice with Mycroft's Mimic Recording Studio & Coqui Text to Speech.