This is almost VITS but decoder is replaced with Vocos for performance.
Running run.sh will automatically download the data and begin training.
cd scripts
./run.sh
synthesize.sh uses last.ckpt by default, so if you want to use a specific weight, change it.
cd scripts
./synthesis.sh
pip install torch torchaudio lightning tqdm pandas matplotlib
WIP