omarbakker / digibill

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

digibill

TODO

Line/Word Detection

  • MSER segmentation
  • Kmeans clustering
  • Kmeans++ clustering
  • Hierarchical Clustering
  • Stopping criteria for hierarchical clustering
  • DBSCAN (most likely to yield reasonable results)
  • OPTICS
  • findCountours from openCV (see dzone article)

Dataset Generation

  • Generate images of text
  • Generate images of text with a given font (color, size, type, bold/italics/underline)
  • Find an appropriate list of fonts to use
  • Find an appropriate list of transformations to make data realistic
  • Find a corpus (UPC database)
  • Generate the synthetic dataset using the above steps

Design/write the training procedure

  • CNN to extract features from word images
  • Bi-LSTM takes sequence of CNN extracted features
  • CTC classifies
  • Batch norm where appropriate

Important links

About


Languages

Language:Python 60.8%Language:C++ 22.8%Language:Makefile 12.5%Language:CMake 3.9%