rafaelgrecco / extract_table_from_pdf

This repository contains a personal project developed during the studies of the pdflumber library

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

author GPLv3 license contributions welcome

Extract Tables from Pdf Files

This application emerged from the studies of the pdfplumber library. Its operation is extremely simple and is very useful to help people who deal daily with pdf files and need to extract tables from these files in their work, college or other activity. Improvement modifications are welcome, it's still in its first version, so it might malfunction with some files.

For use, see the model, available in the test folder, ensure that your pdf file is in this way. The model used to build the application can be found on the official github of the pdfplumber library.

Using the application

To use it is extremely simple, just write the following line of code in the folder that contains the main.py file:

python3 main.py --path *

Where * represents the directory where the pdf file is stored

Author

Rafael Messias Grecco

*Electrical engineering student

Electrical Engineering student and technology enthusiast with projects in the fields of Data Science, Machine Learning, Deep Learning, Computer Vision and Time Series.

Links:

Other Projects

About

This repository contains a personal project developed during the studies of the pdflumber library


Languages

Language:Python 100.0%