LAION-AI / natural_voice_assistant

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

StyleTTS2 for german language

ichDaheim opened this issue · comments

A version of StyleTTS2 generating german language with good emphasis would be a giant leap forward in my opinion.
All the existing open source TTS systems for german are sounding somewhat robotic or indifferent. (no comparison to the english versions - as far as i can judge as a non native english speaking person).

Not sure because of the Licenses but maybe these are helpful ?
Mozilla Common Voice: (approximately 1300 hours of german audio)
https://commonvoice.mozilla.org/de/datasets

Thorsten-Voice:
https://github.com/thorstenMueller/Thorsten-Voice?tab=readme-ov-file#voice-datasets

Hey @ichDaheim, thanks for your post. You are right, mulit-language support is definitely a very important point and already part of our roadmap! StyleTTS2, which we are currently using as TTS system, unfortunately only supports english. We experimented with other TTS models which also supports other languages, but all of them either have a bad voice quality or are significantly slower than StyleTTS. We are currently working on a streamable TTS system than can start synthesizing with just a few text tokens look-ahead. This could drastically decrease the latency and we could then fine-tune it on other languages.

If you are interessted in contributing to this or know any open-source TTS that supports (token-by-token) streaming, I am very happy to chat about it in detail.