OCR needs to define a minimum text size on the pdf
AymanAzzam opened this issue · comments
Ayman Azzam commented
when the text size is so small as in references, the OCR can't detect the spaces that between characters.
Ayman Azzam commented
We will use word segmentation.
Recommended Solution: We will pick random pages then check if the average word size > 6, do word segmentation.