Kyubyong / tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to synthesize long sentences?

hadiasheri opened this issue · comments

This is a replication of this issue
I've trained tacotron with Nancy corpus for about 1M iterations.
The synthesized speech is great for sentences with maximum length of about 25 words.
However, when you feed a long sentence ( e.g length~50), it fails to produce acceptable speech because some parts of the sentences are missing or pronounced awfully.
Its seems that there might be some alignment failures.
I'd be grateful if you could share your ideas about this problem.