reppy4620 / istft_net

The implementation of iSTFTNet for comparison

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

iSTFTNet

My implementation of iSTFTNet(paper) for JSUT(link) powerd by lightning.

Usage

Running run.sh will automatically download the data and begin training.
So just execute the following commands to begin training.

cd scripts
./run.sh

synthesize.sh uses last.ckpt by default, so if you want to use a specific weight, change it.

cd scripts
./synthesis.sh

Requirements

pip install torch torchaudio lightning pandas

Result

Trained 1000 epochs(612000 steps) with batch_size = 16.

Pretrained model ckpt is here. https://huggingface.co/reppy4620/istft_net_jsut/blob/main/jsut_1000.ckpt

Some audio samples are in asset/sample/

loss plot
Discriminator mel
Generator mel
Feature Matching mel
Mel mel

About

The implementation of iSTFTNet for comparison


Languages

Language:Python 90.8%Language:Shell 7.7%Language:Dockerfile 1.4%