Azure-Samples / cognitive-services-speech-sdk

Sample code for the Microsoft Cognitive Services Speech SDK

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Rate is not working for XiaoxiaoMultilingualNeural

jackfengji opened this issue · comments

Describe the bug

The rate in prosody is not changing the real speaking rate for XiaoxiaoMultilingualNeural voice. It's not working in voice gallery web ui either.

This is the SSML I'm using. It's still very slow.

<speak xmlns="http://www.w3.org/2001/10/synthesis" xmlns:mstts="http://www.w3.org/2001/mstts" xmlns:emo="http://www.w3.org/2009/10/emotionml" version="1.0" xml:lang="zh-CN"><voice name="zh-CN-XiaoxiaoMultilingualNeural"><lang xml:lang="zh-CN"><s /></lang><mstts:express-as style="Default"><prosody rate="+200.00%">今天是我在新公司的第一天,入职后很快投入了新的工作环境并开始熟悉各项事务</prosody></mstts:express-as><s /></voice></speak>

Hi @jackfengji thanks for your feedback!

I tried the SSML you provided, it works and speaks very fast compared with the default rate, would you mind checking that again, I have attached audio FYI
with rate.zip

@LinZhang-Support I found it's not working for eastasia region, I tried and us region and it's working.

Here're the exporeted files from voicegallery from eastasia and us region.

eastasia.zip
us.zip

@LinZhang-Support

Also when I'm trying to use a absolute number for rate, it's not working at all. But it's working for other voices.

<!--ID=B7267351-473F-409D-9765-754A8EBCDE05;Version=1|{"VoiceNameToIdMapItems":[{"Id":"390baec9-d867-4c01-bdcf-04e5848ee7dc","Name":"Microsoft Server Speech Text to Speech Voice (zh-CN, XiaoxiaoMultilingualNeural)","ShortName":"zh-CN-XiaoxiaoMultilingualNeural","Locale":"zh-CN","VoiceType":"StandardVoice"}]}--> <speak xmlns="http://www.w3.org/2001/10/synthesis" xmlns:mstts="http://www.w3.org/2001/mstts" xmlns:emo="http://www.w3.org/2009/10/emotionml" version="1.0" xml:lang="zh-CN"><voice name="zh-CN-XiaoxiaoMultilingualNeural"><lang xml:lang="zh-CN"><s /></lang><mstts:express-as style="Default"><prosody rate="2">今天是我在新公司的第一天,入职后很快投入了新的工作环境并开始熟悉各项事务</prosody></mstts:express-as><s /></voice></speak>

Hi @jackfengji this is a bit strange, I cannot reproduce both SSML in eastasia, the rate works fine my side, would you please try again
1.
<mstts:express-as style="Default">今天是我在新公司的第一天,入职后很快投入了新的工作环境并开始熟悉各项事务</mstts:express-as>

今天是我在新公司的第一天,入职后很快投入了新的工作环境并开始熟悉各项事务

This item has been open without activity for 19 days. Provide a comment on status and remove "update needed" label.