About the SAN losses

Question

About the SAN losses

sh-lee-prml opened this issue 10 months ago · comments

Sang-Hoon Lee commented 10 months ago

Thanks for nice work 👍

Now, I'm trainining my TTS model by replacing BigVGAN with BigVSAN.

First, the model with BigVSAN shows slightly better results in early steps! 🚀🚀🚀

I have attached tensorboard graph and I was wondering if maybe you could have similar results during training.

Have you tried to tune the hyper-parameter for gen loss or feature matching loss? The scales of these losses are quite different from the baseline model (BigVGAN),

Thanks again 😊

Takashi Shibuya · Answer 1 · Thu Sep 21 2023 17:27:26 GMT+0800 (China Standard Time)

Thank you for your interest and sharing your learning curves!

That's a good question. There should be room for improvement in hyperparameter setting. We didn't conduct any hyperparameter tuning, and just compared BigVSAN and BigVGAN, giving the same hyperparameter values. We're surely interested in how largely the performance will improve after elaborate hyperparameter tuning, but we're spending our time doing other things now.

Sang-Hoon Lee · Answer 2 · Thu Sep 21 2023 18:55:45 GMT+0800 (China Standard Time)

Could you share your learning curves?

I hope to have a handle on training SAN 👀

Takashi Shibuya · Answer 3 · Thu Sep 21 2023 21:26:00 GMT+0800 (China Standard Time)

Here they are!

We didn't record a loss for each discriminator as you're doing. We have only information on total losses for multiple discriminators. Light blue curves are for our BigVSAN, and pink ones for our BigVGAN reproduction. I hope these are informative.