iSTFTNet

My implementation of iSTFTNet(paper) for JSUT(link) powerd by lightning.

Usage

Running run.sh will automatically download the data and begin training.
So just execute the following commands to begin training.

cd scripts
./run.sh

synthesize.sh uses last.ckpt by default, so if you want to use a specific weight, change it.

cd scripts
./synthesis.sh

pip install torch torchaudio lightning pandas

Trained 1000 epochs(612000 steps) with batch_size = 16.

Some audio samples are in asset/sample/

The implementation of iSTFTNet for comparison

Language:Python 90.8%Language:Shell 7.7%Language:Dockerfile 1.4%