RaviTeja51 / EATEN_OCR

This repo contains tensorflow implemenation of the paper EATEN: Entity-aware Attention for Single Shot Visual Text Extraction

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Eaten-OCR

This is a repo containing tensorflow implementation of the paper EATEN: Entity-aware Attention for Single Shot Visual Text Extraction. This is a single shot text extraction model for extracting Entity of Interest. The data mentioned in the paper can be found here. In the paper three different datasets were used namely train tickets, business cards,passport. The code in this repo is written for a subset of train tickets dataset due to computational resources constrains,however it can be easily extended to all the three datasets which are mentioned in the paper with small changes to the code.

Note: This implementation does not contain label smoothing as mentioned in the paper

How to use?

Clone the git repo and download the cnn encoder, entity decoders, tokenizers from here and then execute gui.py

User Interface

GUI

Image selection

Inference

Inference

About

This repo contains tensorflow implemenation of the paper EATEN: Entity-aware Attention for Single Shot Visual Text Extraction


Languages

Language:Python 100.0%