Extract Tables from Pdf Files

This application emerged from the studies of the pdfplumber library. Its operation is extremely simple and is very useful to help people who deal daily with pdf files and need to extract tables from these files in their work, college or other activity. Improvement modifications are welcome, it's still in its first version, so it might malfunction with some files.

For use, see the model, available in the test folder, ensure that your pdf file is in this way. The model used to build the application can be found on the official github of the pdfplumber library.

Using the application

To use it is extremely simple, just write the following line of code in the folder that contains the main.py file:

python3 main.py --path *

Where * represents the directory where the pdf file is stored

Author

Rafael Messias Grecco

_{*Electrical engineering student}

Electrical Engineering student and technology enthusiast with projects in the fields of Data Science, Machine Learning, Deep Learning, Computer Vision and Time Series.

Links:

Other Projects

Classificação de Roupas com Deep Learning MNIST: http://bit.ly/3pQxIb7
Detecção de Pneumonia com Deep Learning: http://bit.ly/2ZOxRBl
Previsão de Demanda com Prophet - Time Series: http://bit.ly/2ZOz7EB
Streamlit Aplicado ao Dataset do UBER: http://bit.ly/3pTSHKi
Add-Watermark Project: http://bit.ly/3uxIDdu
Filter Selector with Streamlit and OpenCV: https://bit.ly/3yD62vT
Heart Attack Analysis and Prediction: https://bit.ly/3wy3FbZ
Análise de Dados do Airbnb: https://bit.ly/2Zet7Iq
Water Quality Classification: https://bit.ly/3vuE9Fu

rafaelgrecco / extract_table_from_pdf

Extract Tables from Pdf Files

Using the application

Author

Rafael Messias Grecco

Other Projects

About

Languages