WebOCR

Minimalistic OCR from camera devices. Recognize and copy text from photos made on your mobile device or any other device with camera access. This solution uses TesseractJS as OCR system.

demo

visit https://nevolin.be/webocr/
allow camera access
aim at some text and click the button
wait a few seconds for the image to be processed
the detected text will be shown below

notes

The OCR system does a pretty good job, especially with numbers and special characters. However it's definitely not perfect and can produce inaccuracies.

installation

You need NodeJS v12+.

Clone or download this repository, then execute npm install to download dependencies.

Use node server.js to launch the service script.

To keep it running 24/7 use a library like PM2 and execute: pm2 start ecosystem.config.js .

There are two ways to carry out OCR: client-side and server-side (default). Client-side OCR runs in the browser, it is much slower but could be tweaked using more workers. For these configurations you should consult TesseractJS's API docs. To enable client-side OCR use the function localProcessImg() instead of serverProcessImg() inside /public/main.js.