WZBSocialScienceCenter / pdftabextract

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

Home Page:https://datascience.blog.wzb.eu/2017/02/16/data-mining-ocr-pdfs-using-pdftabextract-to-liberate-tabular-data-from-scanned-documents/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

WZBSocialScienceCenter/pdftabextract Stargazers