javascript kaldi-gstreamer-server recorderjs speech-recognition speech-to-text

dictate.js

dictate.js is a small Javascript library for browser-based real-time speech recognition. It uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

API

The API is modeled after Android's SpeechRecognizer. See the source code of lib/dictate.js and the usage in demos/demo.js.

Running the demos

The demos connect to the public services running on wss://bark.phon.ioc.ee that offer Estonian and English speech recognition.

The demos are available here. (Note that in order to use a wss-service the HTML-pages must be loaded over https.)

To run the demos on localhost, start a local HTTP server, e.g.:

python3 -m http.server 8081

and then open e.g. http://localhost:8081/demos/mob.html.

Browser support

Known to work in:

Mozilla Firefox 73.0.1 on Ubuntu 19.10
Chrome 80 on Android 10
Opera 56.1 on Android 10
various iOS and Windows devices (see #27)

Issues:

Google Chrome 81.0 on Ubuntu 19.10 (WebSocket connection to 'wss://...' failed: Error in connection establishment: net::ERR_SSL_OBSOLETE_VERSION)

About

A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

https://kaljurand.github.io/dictate.js/

javascript kaldi-gstreamer-server recorderjs speech-recognition speech-to-text

BSD 3-Clause "New" or "Revised" License

Languages

Language:JavaScript 100.0%

Kaljurand / dictate.js

dictate.js

API

Running the demos

Browser support

See also

About

Languages