Implementation of SoundStorm built upon SpeechTokenizer.
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
xinkez opened this issue 8 months ago · comments
Hi,
How can I get semantic_tokens from input text? Thank you in advance.
You can train a transformer to generate semantic tokens from input text like what audiolm/spear-tts do