There are 1 repository under pdfplumber topic.
Convert PDF into an audiobook.
NLP model for extracting chinese datas from the documents
This project facilitates the extraction of text from PDF files using various Python libraries. It is designed to be flexible, allowing the choice among different text extraction libraries and supporting both single PDF file and directory containing multiple PDF files.
Extracting details from Resume(CVs) and matching with Job Description(JDs) using pretrained model like DistilBERT and ranking them using cosine similarity.
This is my exploration of a variety of Python 🐍 libraries. I have built geospatial data analytics systems from CSV files, Image and video processing tools like face detection and motion detection. I also built a website with flask (and three.js), I built apps connecting to several types of databases. Created a simple budgeting app that reads, writes and updates .txt files. I also created a simple graphic user interface for Mac.
Interface developed to extract information from web through scraping and summarize given data.
Business objective- The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention
Программа парсит несколько pdf-отчетов, ищет необходимую информацию о серии и флаконах, формирует отчет и создает excel-файл с отчетом.
collecting data from the Barcelona City Hall Open Data Service's on socioeconomic indicators of the territorial division of the city of Barcelona
Scrapes data tables from a PDF file.
PDF to MP3, audio book convertor
A sample script to extract text data from a pdf file, converts it to a pandas data frame, and saves it into a CSV file.
Scrapes hazardous waste data from a website and PDF file for PACT Act. Cleans the data to prepare it for mapping.
Flask app that creates reecognizes overude items through parsing a an automated PDF from the checkout system, and generates emails for late users to bring the items back
Organizar extratos de noticias de arquivos pdf
GUI app for parsing specific PDF files (data from standardized Vehicle Registration smart card - Republic of Serbia) and generating data file for specific use case.
Scrapes hazardous waste data from a website and PDF file. Cleans and analyzes the data. Prepares the data for mapping.
This repository contains a Python script for comparing PDF files between a local source folder and a remote server. The script logs results, highlighting identical and non-identical files based on size and page count. It employs "pdfplumber" for PDF handling and "paramiko" for SSH connections.
Business objective- The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention
A Python script to extract and parse tax details from a Form 16 PDF file using pdfplumber and regular expressions.
web app to query Cosenza bus timetable
This is a resume screening web app. The user can upload a resume in PDF format. The app will go through all the text in the resume and find the best possible job role for the user among a set of roles.
This project was done under the continued supervision and mentorship offered by the program ”Carreras Con Impacto” in order to be completed in a span of 12 weeks.
A Flask application that extracts invoice details from uploaded PDFs and images using LLM inference API
This project is an application that processes attached PDF documents containing flight information and extracts relevant data. The data is stored in a PostgreSQL database and visualized on a dynamic dashboard using Streamlit.
The goal of this project was to analyze timesheet data from various programs at Sault College to determine the peak hours when the most programs have classes on campus simultaneously.
This script parses Canadian Tire Financial Services (ctfs.com) PDF statement files and extracts the relevant transaction information which is then categorized and appended to an Excel file
🔴This is my work that was given to me by the recruiter.
Chat/Query with your pdf, txt, csv, docs files, also from links of blogs.