Amr-Aboshama / XGeN

An automated Exam Generator using Natural Language Processing Techniques.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

OCR needs to define a minimum text size on the pdf

AymanAzzam opened this issue · comments

when the text size is so small as in references, the OCR can't detect the spaces that between characters.

We will use word segmentation.

Recommended Solution: We will pick random pages then check if the average word size > 6, do word segmentation.