About the SAN losses
sh-lee-prml opened this issue Β· comments
Thanks for nice work π
Now, I'm trainining my TTS model by replacing BigVGAN with BigVSAN.
First, the model with BigVSAN shows slightly better results in early steps! πππ
I have attached tensorboard graph and I was wondering if maybe you could have similar results during training.
Have you tried to tune the hyper-parameter for gen loss or feature matching loss? The scales of these losses are quite different from the baseline model (BigVGAN),
Thanks again π
Thank you for your interest and sharing your learning curves!
That's a good question. There should be room for improvement in hyperparameter setting. We didn't conduct any hyperparameter tuning, and just compared BigVSAN and BigVGAN, giving the same hyperparameter values. We're surely interested in how largely the performance will improve after elaborate hyperparameter tuning, but we're spending our time doing other things now.
Could you share your learning curves?
I hope to have a handle on training SAN π