An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
inconnu11 opened this issue 2 years ago · comments
Hi, is the pitch/energy normalized within corpus instead of within speaker? Would it be better within speaker?