Inpyo Lee's repositories
Hitomi-Downloader-Mac
Hitomi Downloader for macOS
mellotron-korean
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
acoustic-model
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
CleanUNet
Official PyTorch Implementation of CleanUNet (ICASSP 2022)
radtts
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
easysd
Drop-and-run script for Automatic1111's Stable Diffusion WebUI
flowtron-korean
Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
goxel
GoXel - Download accelerator in Go
hifigan
An 16kHz implementation of HiFi-GAN for soft-vc.
hubert
HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
KITScenarist
Screenwriting software.
Learn2Sing2.0
Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
m3u8-Downloader-Go
m3u8 downloader with golang
noteshrink
Convert scans of handwritten notes to beautiful, compact PDFs
ProDiff
PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline
SD-UI
Stable Diffusion web UI
soft-vc
Soft speech units for voice conversion
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
standard-demo-we-extensions
Extension index for stable-diffusion-webui
standarddemo
High-Resolution Image Synthesis with Latent Diffusion Models
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP