mozilla / DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Repository from Github https://github.commozilla/DeepSpeechRepository from Github https://github.commozilla/DeepSpeech

How can I get syllable or phoneme with Deepspeech

tiennguyen12g opened this issue · comments

Hello everyone, I am find the way to get syllable from the word and confidence of word (score).
If you have any experience in this topic, please give me a guide.
😘😘Thank you so much.

DeepSpeech, developed by Mozilla, excels in converting spoken words to text but doesn't provide syllable or phoneme-level details. For such linguistic features, consider using specialized libraries like NLTK in Python or PyDictionary, which offer syllable information. These can be integrated to analyze syllables or phonemes alongside DeepSpeech for a comprehensive solution.