machine-learning ocr deep-learning cnn craft keras-tensorflow neural-network convolutional-neural-networks emnist iam line-segmentation word-segmentation character-segmentation

OCR-Dev

A small computer vision project in the making. The goal is to create a simple website that implements CNN Keras model to recognize handwritten characters (letters and digits) from an input file (a photo, image, or a scanned file). Beside CNN Keras, we are also using CRAFT (Character-Region Awareness For Text detection) in Pytorch. The point is to run our model on seperate character.

Progress

We used the popular MNIST dataset with over 60k images of handwritten digits to practice using CNN in Keras.
Developed a more complexed CNN model on EMNIST (Extended-MNIST) dataset that returns 91.30% accuracy for EMNIST Bymerge and 88.20% for EMNIST Byclass.
Current models are performing low accuracy on non-EMNIST testing dataset.
Using CRAFT to extract character images from IAM dataset as a data augmentation method.
Training a new model on the new dataset.
Implementing prebuilt tools to detect lines in texts.
Implementing prebuilt tools to convert italic texts to straight texts.

Partners

Minh Quan Huynh: https://github.com/hmq1812
Duc Minh Hoang: https://github.com/minhduchoang301/OCR-community-website

Updating

Please click on the folders to see new updates. The project is still in the process.

About

A small computer vision project in the making. Partners: Minh Quan Huynh and Duc Minh Hoang.

machine-learning ocr deep-learning cnn craft keras-tensorflow neural-network convolutional-neural-networks emnist iam line-segmentation word-segmentation character-segmentation

Apache License 2.0

Languages

Language:Jupyter Notebook 99.7%Language:Python 0.3%