A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Home Page:https://datascience.blog.wzb.eu/2017/02/16/data-mining-ocr-pdfs-using-pdftabextract-to-liberate-tabular-data-from-scanned-documents/
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool