domcross / german-stt-evaluation

Evaluation of STT models for german language

Home Page:https://domcross.github.io/german-stt-evaluation/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Jaco-Assistant / Scribosermo setup?

fquirin opened this issue Β· comments

Hi Dominik,

these are some nice tests πŸ‘ . I was wondering if you have any documentation on how you got Scribosermo running? It looks like performance and size are very good and I'd love to add it to the SEPIA STT-Server, but I have some difficulties getting started (maybe this could be an entry point?).

Have you ever tried running it on a Raspberry Pi?

Oh and btw did you do some tests with Coqui as well?

Hi Florian,

I was wondering if you have any documentation on how you got Scribosermo running?

I plan to publish the code-snippets I have used for evaluation soon.

Have you ever tried running it on a Raspberry Pi?

Not yet, might give it a try on a RPI3 in the future...

Oh and btw did you do some tests with Coqui as well?

The tflite-version of DeepSpeech-German-v0.9.0 model was tested with Coqui-STT as well, but the results were not too good (WER: 33.74, CER: 12.54).

Thanks for the info, I'll keep an eye out for updates πŸ™‚

Not yet, might give it a try on a RPI3 in the future...

I'd be very interested in the results! Vosk runs smoothly on RPi4 and I'm already very happy with the performance (especially when you train your own LMs), but I'd like to offer a broader range of engines and Scribosermo looks really impressive in your tests. The RTF gives me hope that it might work on RPis as well ^^

Hi @domcross , I saw your update and decided to give it a try on Raspberry Pi OS aarch64 version.

Do you maybe happen to have a Wheel for 'ds_ctcdecoder' that runs on Python 3.7 aarch64 :sweat_smile: ... I saw your Python 3.6 version and tried to downgrade the Pi but v3.6 is giving me a lot of other headaches :see_no_evil:

A cp37/armv7-wheel for RPI3 can be found here, maybe this works for RPI4/aarch64 as well?
https://gitlab.com/Jaco-Assistant/Scribosermo/-/tree/master/extras/misc

Ty, I've seen that folder and noticed there might be a build script as well, but unfortunately the 32bit Wheels won't work on aarch64 :-(

I will use the additional hour we gain through time savings tonight by trying to build a wheel for cp37/aarch64...

:star_struck: gl :grinning:

pip installation worked πŸ₯³ thanks a lot!
working on the remaining packages now ^^
I'm building wheels for 'numpy' and 'tokenizers' (which requires Rust compiler) right now. Next stop will be librosa 🀞

A little update: It works very well on Raspberry Pi4 2GB aarch64 πŸ˜€

I've collected all my tests here: https://github.com/fquirin/scribosermo-stt-setup

There is one thing that confuses me though. Do you understand the license of the model checkpoints? It seems Daniel took training weights for QuartzNet from Nvidia's NGC page which means the terms of usage apply and probably prohibit usage on certain systems or under certain conditions which are unclear to me :-/

I wasn't aware that the training weights were taken vom Nvidia.
To avoid any legal/license issues I did not include the pretrained models on purpose ;-)

Smart move ;-) ... I didn't dare to host them in the Git repo either :-(, hope we can resolve this issue somehow.

Closing as issue is resolved.