tenebo

followers

following

stars

Inpyo Lee's repositories

Hitomi-Downloader-Mac

Hitomi Downloader for macOS

g2pk2

Updated folk of g2pk

Language:PythonApache-2.0600

mellotron-korean

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Language:Jupyter NotebookBSD-3-Clause100

acoustic-model

Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Language:PythonMIT000

CleanUNet

Official PyTorch Implementation of CleanUNet (ICASSP 2022)

Language:PythonMIT000

DemarestGPA

Language:JavaScriptMIT000

Grad-TTS

Language:PythonMIT000

Link-Collector

Language:CSSMIT000

radtts

Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.

Language:RoffMIT000

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:Jupyter NotebookMIT000

easysd

Drop-and-run script for Automatic1111's Stable Diffusion WebUI

000

flowtron-korean

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Language:Jupyter NotebookApache-2.0000

free.webtoon.ga

Language:HTML000

goxel

GoXel - Download accelerator in Go

Language:GoApache-2.0000

HD

000

hifigan

An 16kHz implementation of HiFi-GAN for soft-vc.

MIT000

hubert

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

MIT000

KITScenarist

Screenwriting software.

GPL-3.0000

Learn2Sing2.0

Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher

000

m3u8-Downloader-Go

m3u8 downloader with golang

000

noteshrink

Convert scans of handwritten notes to beautiful, compact PDFs

Language:PythonMIT000

ProDiff

PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline

Language:PythonMIT000

SD-UI

Stable Diffusion web UI

Language:PythonAGPL-3.0000

soft-vc

Soft speech units for voice conversion

MIT000

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

000

standard-demo-we-extensions

Extension index for stable-diffusion-webui

000

standarddemo

High-Resolution Image Synthesis with Latent Diffusion Models

MIT000

steal-danger-online

Language:PythonAGPL-3.0000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.0000

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E, WIP

MIT000