ArnabBanik-repo / flyer-information-extractor

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CardToExcel

A web application that lets you automate guest lecture information extraction from invitation cards

Installation

In order to run CardToExcel you will need the following dependencies

  • Python3
  • flask using pip3
  • openpyxl using pip3
  • Pillow using pip3
  • Tesseract-OCR
  • pytesseract using pip3

Instructions

  • Create an empty directory and cd into it
mkdir CardToExcel 
cd CardToExcel
  • Clone the repository
git clone --depth=1 https://github.com/ArnabBanik-repo/flyer-information-extractor/ .
  • Install Tesseract-OCR on your system For windows: Save it in C:\Program Files (x86)

  • Install dependencies

  ./bin/pip3 install flask openpyxl pytesseract
  • Run the server
  ./bin/python3 server.py
  • Get sample Invitation Flyer images from Dataset_generator/dataset/images folder

In order to generate more Flyer samples, cd into the Dataset_generator directory and run the dataset_generator

  ./bin/python3 dataset_generator.py

Authors

About


Languages

Language:Python 51.0%Language:PowerShell 25.3%Language:HTML 15.0%Language:Shell 8.8%