ccoreilly / LocalSTT

Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Small scorer file

ellenhp opened this issue · comments

Hey! Very excited to see this. It seems to still build and function in 2024. I had to update the version of the vosk dependency to 0.3.47 to get it to build. For DeepSpeech/English, I had to replace the model files with the 0.8.2 deepspeech releases and add org.gradle.jvmargs=-Xmx16g to gradle.properties to let gradle build with the massive multilingual .scorer file. After that I switched up the paths in app/src/main/java/cat/oreilly/localstt/DeepSpeechRecognitionService.java and it works. I did have to use ndk 20.1.5948944, it wouldn't work with the latest NDK.

I was wondering how you got the very small .scorer file for Catalan? Is there a script somewhere I could run to extract single languages from the big scorer file? Or did you generate your own? It looks like there might be ways to do that, but I haven't looked into it too closely yet. The scorer file from the DeepSpeech release seems to work with the little bits of Spanish I know, so I'm guessing it works with every language that DeepSpeech supports. No wonder it's so big! 😅

Thank you so much for publishing this. It's really helpful.