JonathanLink / PDFLayoutTextStripper

Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (from the Apache PDFBox library).

Home Page:https://jonathanlink.ch/PDFLayoutTextStripper.html

Repository from Github https://github.comJonathanLink/PDFLayoutTextStripperRepository from Github https://github.comJonathanLink/PDFLayoutTextStripper

JonathanLink/PDFLayoutTextStripper Stargazers