Small scorer file
ellenhp opened this issue · comments
Hey! Very excited to see this. It seems to still build and function in 2024. I had to update the version of the vosk dependency to 0.3.47
to get it to build. For DeepSpeech/English, I had to replace the model files with the 0.8.2 deepspeech releases and add org.gradle.jvmargs=-Xmx16g
to gradle.properties to let gradle build with the massive multilingual .scorer
file. After that I switched up the paths in app/src/main/java/cat/oreilly/localstt/DeepSpeechRecognitionService.java
and it works. I did have to use ndk 20.1.5948944
, it wouldn't work with the latest NDK.
I was wondering how you got the very small .scorer
file for Catalan? Is there a script somewhere I could run to extract single languages from the big scorer file? Or did you generate your own? It looks like there might be ways to do that, but I haven't looked into it too closely yet. The scorer file from the DeepSpeech release seems to work with the little bits of Spanish I know, so I'm guessing it works with every language that DeepSpeech supports. No wonder it's so big! 😅
Thank you so much for publishing this. It's really helpful.