fatchord / WaveRNN

WaveRNN Vocoder + TTS

Home Page:https://fatchord.github.io/model_outputs/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

About Predicted Features VS Ground Truth

bringtree opened this issue · comments

Hello, thank you for sharing the code。
When I learn this code, I meet a question.

image
As shown in the Figure, in training, it is better to use the prediction feature. But I found the code use ground truth to compute attention weight.

prenet_in = m[:, :, t - 1] if t > 0 else go_frame

prenet_out = self.prenet(prenet_in)

I want to know the reason why you use the Ground Truth.

I got it wrong