ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Home Page:https://0nutation.github.io/SpeechTokenizer.github.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Cross-lingual

coding-sharks opened this issue · comments

Hello, I used the checkpoint file you trained with librispeech to infer the Chinese audio and it still works well. Is that what you expected? Because your dataset doesn't seem to use Chinese, only English data.