WebOCR
Minimalistic OCR from camera devices. Recognize and copy text from photos made on your mobile device or any other device with camera access. This solution uses TesseractJS as OCR system.
demo
- visit https://nevolin.be/webocr/
- allow camera access
- aim at some text and click the button
- wait a few seconds for the image to be processed
- the detected text will be shown below
notes
The OCR system does a pretty good job, especially with numbers and special characters. However it's definitely not perfect and can produce inaccuracies.
installation
You need NodeJS v12+.
Clone or download this repository, then execute npm install
to download dependencies.
Use node server.js
to launch the service script.
To keep it running 24/7 use a library like PM2 and execute: pm2 start ecosystem.config.js
.
devs
There are two ways to carry out OCR: client-side and server-side (default).
Client-side OCR runs in the browser, it is much slower but could be tweaked using more workers. For these configurations you should consult TesseractJS's API docs. To enable client-side OCR use the function localProcessImg()
instead of serverProcessImg()
inside /public/main.js
.
screenshots
Contact
For enquiries or issues get in touch with me:
Name: Ilya Nevolin
Email: ilja.nevolin@gmail.com