There are 0 repository under scanned-image-pdfs topic.
Extract tables from scanned image PDFs using Optical Character Recognition.
BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on scanned forms.
This batch script creates a searchable PDF of a PDF with one or more scanned pages which contain images.
Debian packaging of pdfbeads
Growing collection of scripts that manipulate text data.