print(ds.stt(audio)) UnicodeEncodeError: 'utf-8' codec can't encode characters in position 558-559: surrogates not allowed

Question

print(ds.stt(audio)) UnicodeEncodeError: 'utf-8' codec can't encode characters in position 558-559: surrogates not allowed

dindaya opened this issue 3 years ago · comments

Hi I am using deepspeech trained models "deepspeech-0.9.3-models-zh-CN.pbmm" and "deepspeech-0.9.3-models-zh-CN.scorer" for Mandarin inference in Ubuntu 18.04. It is working but for some audio files it is giving below error.

Error:
"print(ds.stt(audio))
UnicodeEncodeError: 'utf-8' codec can't encode characters in position 558-559: surrogates not allowed"

I am not able to correct this error, any support will be helpful.

Regards,
Naval

For support and discussions, please use our Discourse forums.

If you've found a bug, or have a feature request, then please create an issue with the following information:

Have I written custom code (as opposed to running examples on an unmodified clone of the repository):
OS Platform and Distribution (e.g., Linux Ubuntu 16.04):
TensorFlow installed from (our builds, or upstream TensorFlow):
TensorFlow version (use command below):
Python version:
Bazel version (if compiling from source):
GCC/Compiler version (if compiling from source):
CUDA/cuDNN version:
GPU model and memory:
Exact command to reproduce:

You can obtain the TensorFlow version with

python -c "import tensorflow as tf; print(tf.GIT_VERSION, tf.VERSION)"

Please describe the problem clearly. Be sure to convey here why it's a bug or a feature request.

Include any logs or source code that would be helpful to diagnose the problem. For larger logs, link to a Gist, not a screenshot. If including tracebacks, please include the full traceback. Try to provide a reproducible test case.

lissyx · Answer 1 · Wed Sep 08 2021 18:29:32 GMT+0800 (China Standard Time)

For support and discussions, please use our Discourse forums.

iris · Answer 2 · Tue Dec 21 2021 14:56:24 GMT+0800 (China Standard Time)

so how to fix it? I can't find any questions identical to this one. I come to this problem when performing the inference.