Ryan315 / TextGenerator

OCR dataset Text-Detection dataset Font-Classification dataset generator

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

TextGenerator

  • This is a tool for ocr dataset, text detection, fonts classification dataset generate.

Functions:

  • different font, size, color, rotation
  • multithreading
  • specific layout
  • find smooth area to attach
  • character-wised annotation

image generated:

image attach:

rotation

bounding box of single word

```
# step 1
pip install requirements.txt
# step 2
sh make.sh
```
  • config file config.yml

  • run script

    python3 run.py
    
  • data generated config.yml -> provider> layout> out_put_dir

content modified

  1. np.int related
  2. check the piplist.txt for package version

todo:

Math generation and manipulation for formula detection task

About

OCR dataset Text-Detection dataset Font-Classification dataset generator

License:MIT License


Languages

Language:Python 92.2%Language:Cython 7.6%Language:Shell 0.1%