Experimenting with automatic WASM based machine learning translation of videos.
Contains a basic UI and a framework to load videos into it. Detecting audio in the video will produce a stream of utterances next to the video.
Based on the https://github.com/ccoreilly/vosk-browser implementation of the Vosk machine learning models in WASM.
Use it online here: https://captioner.richardson.co.nz/