Inpyo Lee's repositories

g2pk2

Updated folk of g2pk

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0

mellotron-korean

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0

acoustic-model

Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CleanUNet

Official PyTorch Implementation of CleanUNet (ICASSP 2022)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:CSSLicense:MITStargazers:0Issues:0Issues:0

radtts

Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.

Language:RoffLicense:MITStargazers:0Issues:0Issues:0

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

easysd

Drop-and-run script for Automatic1111's Stable Diffusion WebUI

Stargazers:0Issues:0Issues:0

flowtron-korean

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

goxel

GoXel - Download accelerator in Go

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

hifigan

An 16kHz implementation of HiFi-GAN for soft-vc.

License:MITStargazers:0Issues:0Issues:0

hubert

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

License:MITStargazers:0Issues:0Issues:0

KITScenarist

Screenwriting software.

License:GPL-3.0Stargazers:0Issues:0Issues:0

Learn2Sing2.0

Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher

Stargazers:0Issues:0Issues:0

m3u8-Downloader-Go

m3u8 downloader with golang

Stargazers:0Issues:0Issues:0

noteshrink

Convert scans of handwritten notes to beautiful, compact PDFs

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ProDiff

PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SD-UI

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

soft-vc

Soft speech units for voice conversion

License:MITStargazers:0Issues:0Issues:0

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stargazers:0Issues:0Issues:0

standard-demo-we-extensions

Extension index for stable-diffusion-webui

Stargazers:0Issues:0Issues:0

standarddemo

High-Resolution Image Synthesis with Latent Diffusion Models

License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E, WIP

License:MITStargazers:0Issues:0Issues:0