MinHyung-Jo's repositories
PolyLangVITS
Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)
AudioSR-Upsampling
AudioSR-Upsampling (any -> 48kHz)
One-Click-VITS-Training
VITS(Data Preprocessing + Whisper ASR + Text Preprocessing + Modification config.json + Training, Inference)
Midi-to-Singing-Voice-Conversion
Vocal Synthesis Through MIDI and Vocal Transformation Using RVC (KO, EN, JA, ZH)
One-Click-MB-iSTFT-VITS2
MB-iSTFT-VITS2(Data Preprocessing + Whisper + Text Preprocessing + Making config.json + Training, Inference) ONE-CLICK
MB-iSTFT-VITS-Korean
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Korean Cleaners
Korean-Diff-Font
Korean-Diff-Font: Diffusion Model for Robust One-Shot Korean Font Generation(Pretrained Model Included)
one-click-vits-config
VITS(Datasets Preparation + Whisper ASR + Text Preprocessing + config.json)
BERT-MB-iSTFT-VITS
High-quality Multilingual(Korean, Japanese, Chinese, English, French and Spanish) TTS Model based on VITS
Efficient-Speech
Lightweight Korean TTS Model based on FastSpeech2
Best-Hospital-Location
A Study on the Status of Social Service and Hospital Location Optimization to Respond to the Aging in Jeollanam-do
IMAS-Downloader
IM@S-Downloader
versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.