There are 1 repository under pypdf2 topic.
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Benchmarking PDF libraries
Multiple and Large PDF Documents Text Extraction.
A python library/command-line tool to extract the DOI or other identifiers of a scientific paper from a pdf file.
Remove PDF watermarks from academic papers using pypdf
A cross-platform utility to join, split, stamp, and rotate PDFs written in Python. Yes, Python!
pdfgui_tools is a user interface tool developed in Qt and Python that integrates with poppler-utils and PyPDF2 for PDF document management. It's a simple and user-friendly tool that includes various utilities.
pdf文件处理工具, 包含: pdf剪切, pdf旋转, pdf合并, pdf拆分, pdf添加页码, pdf转图片, word转pdf等功能
Batch-convert pdf to text, extract data from pdf in python
This is a complete website in which you can chat with pdf, extract meta data, text, links, image, and lot more . Check my blog for more details: https://medium.com/@amit.2503719/allaboutpdf-tool-for-data-extraction-and-talking-to-pdf-using-chatpdf-feature-f2daea15a59c
A user-friendly application that allows users to upload PDF documents and receive concise summaries generated using advanced Large Language Models (LLMs).
Smart ATS evaluates resumes against job descriptions, providing match percentage, missing keywords, and improvement suggestions.
Prepare documents for distribution
A simple and offline PDF audio reader
:rocket:Parse PDFs, Word and Excel documents. Read, Create, Merge/Combine, Extract data from office documents.
This Flask App would remove CamScanner watermark from scanned pdfs.
Simple pdf to text with python using PDFtk and PyPDF2
AgriGenius: AI-Powered Agriculture Chatbot is a Python web application designed to empower farmers with information accessibility. AgriGenius leverages a Retrieval-Augmented Generation model to address farmer's agricultural queries with precise answers.
A desktop application which helps students to choose Disciplinary and Open Electives wisely.
Simple Python GUI Tool for Tesseract4
get local e paper ( Dainik Jagron and Hindustan )
The "MCQ Generator with Streamlit" web app utilizes OpenAI's language models to create multiple-choice questions (MCQs) from uploaded PDF or text files. Users can customize question parameters like quantity, subject, and tone. The app offers real-time complexity feedback and presents MCQs in an easy-to-read tabular format.
You can convert from a PDF to MP3 file using this python code
A script to convert MS Office PPT/PPTX files to PDF files and then merge all the PDF files to a single PDF file.
A python script to convert the KCT's(Kumaraguru college of technology) academic calendar pdf file into a csv file and will sync the events with google calendar.
A script that generates a pdf file. You can create a new pdf file from an html file or you can write on top of an already existing pdf
Pydf - Pyrogram Document File Bot, a modular Telegram Bot which provides Pdf Tools Works using Pypdf2
Simple python utilities to play around with PDF Files
RAG chatbot using Llama 2, chainlit and Faiss
Provides a comprehensive solution for detecting plagiarism and finding similarities between text documents
This Repository consists of some Python Beginner Level Projects.
Resume Analyzer
A Python-based tool that converts PDF files into editable Word documents, preserving text, images, and layout. Uses PyPDF2, PyMuPDF (fitz), python-docx, and Pillow to accurately transfer content from PDF to .docx. Ideal for transforming complex PDFs into Word format for easy editing.