ming024 / FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

spectrogram image to wav

ahmeftah opened this issue · comments

Dear Authors and readers

I would appreciate it if you would give me an answer to my question:

Is it possible to convert the spectrogram image (not the array) to wav (reconstruct the wav audio from the spectrogram in image form)?

hi! Yes, it is possible, vocoder (hifigan) is stored in this repo and synthesize.py calls it. Don't forget to unzip hifigan/generator_LJSpeech.pth.tar.zip (or different, depends on what ckpt you want to use) before the inference and create all necessary folders (you can check what you need to mkdir from inference errors haha)

Thanks nestyme