mozilla / DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Chinese Mandarin Garbled

wingdi opened this issue · comments

I use pre-trained model deepspeech-0.9.3-models-zh-CN.pbmm, it generated Garbled in GBK or UTF-8 ;
is there any method to repair this ?

this is the generate json :
QQ图片20210314104814

when i print the word value,

word = '\udce5\udcb8\udce5\udce5\udcb9\udcb3\udce6\udcba\udca6\udce5\udc8f\udce6\udcb1\udc89\udce5\udc94' print(word)

it got :
UnicodeEncodeError: 'utf-8' codec can't encode characters in position 0-15: surrogates not allowed

os : win10 ; python Server ; Pycharm(UTF-8)

Why this issue closed ? I didn't find solution in Discourse。 and I think this is a bug , it need to be fixed ...

Hey there! Looks like Discourse is the right way to go for this :)