ad-freiburg / pdfact

A basic tool that extracts the structure from the PDF files of scientific articles.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Cannot extract keywords and abstract from many PDF articles

tmbahadar opened this issue · comments

Which dataset are you using for experimentation?

commented

PDFAct has trouble in finding the bounding box in some PDF files using two columns format
ICEIS_2015_167-xml-out.pdf