Rayhane-mamah / Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Point of

harismeharis opened this issue · comments

I have a basic and probably stupid question. What is the point of using a WaveNet Vocoder to invert the mel spectrogram feature representation into time-domain waveform since we can just use audio.inv_mel_spectrogram and get the audio (voice output) directly from the encoder part?