Add link to GAN-TTS and GAN-VC in doc

Question

Add link to GAN-TTS and GAN-VC in doc

r9y9 opened this issue 7 years ago · comments

I'm currently exploring GANs in text-to-synthesis and voice conversion. When I finish up my work, I will add link to the repo in the doc, to demonstrate the library capability.

Reference: https://arxiv.org/abs/1709.08041

Ryuichi Yamamoto · Answer 1 · Fri Sep 29 2017 18:07:40 GMT+0800 (China Standard Time)

That will be the first example of minimum generation error (MGE) training using autograd module.

https://r9y9.github.io/nnmnkwii/latest/references/autograd.html

Ryuichi Yamamoto · Answer 2 · Mon Oct 02 2017 15:32:03 GMT+0800 (China Standard Time)

Progress:

MGE training for VC/TTS is working reasonably (as baseline), not significantly better than MSE loss though.
GAN-VS is working great as expected. GV is implicitly compensated and speech quality is greatly improved. Very nice!
Implmented GAN-TTS, but it is not working. I suspect adversarial loss for multi-stream features (mgc, lf0, vuv and bap) is difficult to optimize. I'm trying further experiments. MGE loss (mgc, lf0, vuv and bap) + ADV loss (mgc) may work?

Ryuichi Yamamoto · Answer 3 · Fri Oct 06 2017 10:43:12 GMT+0800 (China Standard Time)

https://github.com/r9y9/gantts

Ryuichi Yamamoto · Answer 4 · Mon Oct 09 2017 20:21:50 GMT+0800 (China Standard Time)

GAN-VC and GAN-TTS are now working great. Essential to actually make GAN-TTS work great is that do computing ADV loss with mgc (without 0-th) only, not all features.