Unoffical implementation of Megatts2
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
blackbird-fish opened this issue 5 months ago · comments
大佬, 这个预测目标还是mel, VQGAN做的是语音韵律的离散化, PLM去预测韵律码本的index,是这样理解不