ming024 / FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Inconvergence in pitch and energy loss

zhoufqing opened this issue · comments

hi,
When using my own data for training, the pitch and energy loss did not converge, and the Mel loss decreased to 1. The sampling rate of my data is 44100. I have modified the sampling rate parameter in the preprocess.yaml file to 44100, and the other parameters have not been modified. After training for 100k, the pitch and energy loss did not converge. The final synthesized speech is also entirely noisy.

10万-44 1
10w-44 1