Attention Blank, is it because my progressive training schedule?

Question

Attention Blank, is it because my progressive training schedule?

lilmadman007 opened this issue 3 years ago · comments

my attention is empty after 10k steps, which shouldn't be normal.
I'm using LJSpeech dataset.
This is the second time I preprocessed everything and trained.

Loss is around 1.0 at 10k steps
Are my settings wrong here? Does this not work?

Thanks!

NOTE: I LOOKED AT THIS ISSUE ALREADY -> #154

Ollie McCarthy · Answer 1 · Mon Feb 01 2021 22:52:38 GMT+0800 (China Standard Time)

Hi, sometimes the alignment will fail randomly. I've never tried with batch size of 8 so that could be it. Maybe try finetuning on one of the pretrained models.

AhmadAlAmin21 · Answer 2 · Tue May 04 2021 08:01:31 GMT+0800 (China Standard Time)

did you ever solve this?

lilmadman007 · Answer 3 · Wed May 05 2021 00:21:07 GMT+0800 (China Standard Time)

did you ever solve this?

Sorry for the lack of feedback. No I did not. when fatchord commented that it fails sometimes I tried it again 2 more times, but it
just didn't work. Maybe my gpu is just not good enough, like I said, but I just moved on when I couldn't get results.
Any help would be appreciated anyways!

AhmadAlAmin21 · Answer 4 · Wed May 05 2021 00:30:04 GMT+0800 (China Standard Time)

I think i found a solution,

increase "r" from 7 to 12 in the tts_schedule in hparams.py.
go to models file>tacotron.py, and change line 200 from "scores = torch.sigmoid(u) / torch.sigmoid(u).sum(dim=1, keepdim=True)" to "scores = F.softmax(u, dim=1)".

got this from #154 (comment)