How would I train a TTS model on music? So instead of it talking from a prompt, it makes music from a prompt.
breadbrowser opened this issue · comments
breadbrowser commented
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
breadbrowser opened this issue · comments