This is a demonstration on how to produce speech in a particular emotion from text, this is achieved by fine tuning a TTS model on emotion labelled speech data, formulating it as a multi-modal problem.
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool