Picovoice / cheetah

On-device streaming speech-to-text engine powered by deep learning

Home Page:https://picovoice.ai/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Question on custom keywords/boost

tempo-riz opened this issue · comments

Untitled (1)

Here what is the behavior ?

we can see that "a fire ball" and "a-fire-ball" have the same IPA, does it changes something else ? Like the speed of prononciation or other, not an expect on the topic :')
I would prefer to use the one with dashes - in all my custom that contains spaces is that okay or a bad practice ?

I tested adding them and boosting them (not at the same time) to see the difference but it seems inconsistent...

@tempo-riz, your observations are correct. We treat "a fire ball" exactly the same as "a-fire-ball" - there is no difference in expectations of how it will be pronounced. If you would like one to be differentiated, you can change the custom pronunciation, but other than that it would come down to which word makes the most sense in the context of the surrounding speech.

So just to be sure even if I add/boost "a-fire-ball" the model output will probably be "a fire ball" right ? Because that would make more sense in common speech I guess

Yes, that's correct. In this case it would not be beneficial to have both unless you set two different pronunciations.