pypdf2

There are 1 repository under pypdf2 topic.

py-pdf / pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
pypdf2 pdf python pdf-parser pdf-parsing pdf-manipulation pdf-documents help-wanted
Language:Python 9569
pikepdf / pikepdf
A Python library for reading and writing PDF, powered by QPDF
pdf pdf-generation pdf-manipulation existing-pdfs pypdf2 python pikepdf qpdf
Language:Python 2514
py-pdf / benchmarks
Benchmarking PDF libraries
benchmark data-extraction mupdf pdf poppler-utils pypdf2 text-extraction
Language:Python 315
PDFs-TextExtract
ahmedkhemiri95 / PDFs-TextExtract
Multiple and Large PDF Documents Text Extraction.
pdf parser data-science python pdf-processing extract-text text-analytics pdfs-textextract pdf-document pypdf2 pdfs pdfminer
Language:Python 131
MicheleCotrufo / pdf2doi
A python library/command-line tool to extract the DOI or other identifiers of a scientific paper from a pdf file.
doi python pdf bibtex arxiv identifiers arxiv-identifiers bibtex-entry extract-doi extract metadata pdf-text pypdf2
Language:Python 128
chazeon / PDF-Watermark-Remover
Remove PDF watermarks from academic papers using pypdf
pdf pdf-watermark pdf-manipulation watermark-remover pypdf2 academic-papers
Language:Python 56
doolieSoft / PdfPasswordRemover
Tool made to remove password in pdf files
pdf python pypdf2
Language:Python 50
gaborvecsei / Pdf-Split-Merge
simple pdf file split and merge tool
pdf python3 python pdf-document pypdf2
Language:Python 43
py-pdf / PyPDF-Builder
A cross-platform utility to join, split, stamp, and rotate PDFs written in Python. Yes, Python!
pypdf2 python gui tkinter front-end
Language:Python 39
TheWatcherMultiversal / pdfgui_tools
pdfgui_tools is a user interface tool developed in Qt and Python that integrates with poppler-utils and PyPDF2 for PDF document management. It's a simple and user-friendly tool that includes various utilities.
pdf pdf-document poppler-utils pypdf2 python3 gnu-linux linux pymupdf pyside6 qt6
Language:Python 37
jiandandaoxingfu / pdfdo
pdf文件处理工具, 包含: pdf剪切, pdf旋转, pdf合并, pdf拆分, pdf添加页码, pdf转图片, word转pdf等功能
python pypdf2 pdfdo pdf pdf2img word2pdf doc2pdf pdf-cut pdf-merge pdf-split pdf-number
Language:Python 34
shine-jayakumar / Extract-Data-From-PDF-In-Python
Batch-convert pdf to text, extract data from pdf in python
pdf-converter pdf-to-text pdf-tools pdf-parser python-pdf pypdf2 pypdf data-extraction regular-expressions pdf-reader batch-converter batch-conversion data-cleaning pdf-to-excel pdf-data-extraction pandas indirectobject xpdf pdftotext python-automation
Language:Python 32
amitgupta4407 / All_About_PDF
This is a complete website in which you can chat with pdf, extract meta data, text, links, image, and lot more . Check my blog for more details: https://medium.com/@amit.2503719/allaboutpdf-tool-for-data-extraction-and-talking-to-pdf-using-chatpdf-feature-f2daea15a59c
python langchain chatpdf gpt pypdf2 streamlit
Language:Python 30
santiago9631 / PDF-summarizer-chatbot-using-LLaMa2
A user-friendly application that allows users to upload PDF documents and receive concise summaries generated using advanced Large Language Models (LLMs).
llama2 nlp pdf-summarizer pypdf2 python streamlit
Language:Python 30
Deba951 / Resume-ATS-Tracking-LLM-Project
Smart ATS evaluates resumes against job descriptions, providing match percentage, missing keywords, and improvement suggestions.
gemini-api generative-ai llm pypdf2 streamlit
Language:Python 27
sfneal / pdfconduit
Prepare documents for distribution
pdf pdf-generation pdf-document-processor pdfkit python pypdf2 pdfrw encryption watermark
Language:Python 26
nikhilkumarsingh / PDF_AUDIO_READER
A simple and offline PDF audio reader
pdf-audio-reader pdf audio simple offline python pyttsx3 pypdf2
Language:Python 23
crispyzingy / PDFExcelWordParser
:rocket:Parse PDFs, Word and Excel documents. Read, Create, Merge/Combine, Extract data from office documents.
python openpyxl pypdf2 python-docx automation excel pdf word docs data-extraction data office
Language:Python 22
viveksb007 / camscanner_watermark_remover
This Flask App would remove CamScanner watermark from scanned pdfs.
flask pdf pypdf2 watermark
Language:Python 22
asepmaulanaismail / pdf-to-txt-python
Simple pdf to text with python using PDFtk and PyPDF2
python python3 pdf pdftk pypdf2 text-extraction pdf-extractor pdf-to-text
Language:Python 21
jayeshbhandarkar / AgriGenius
AgriGenius: AI-Powered Agriculture Chatbot is a Python web application designed to empower farmers with information accessibility. AgriGenius leverages a Retrieval-Augmented Generation model to address farmer's agricultural queries with precise answers.
artificial-intelligence chatbot retrieval-augmented-generation flask python chroma langchain pypdf2 requests together-ai api css html javascript meta-llama embeddings sentence-transformers agriculture gen-ai vector-store
Language:JavaScript 16
PranjalGupta2199 / OFFLINE-ERP
A desktop application which helps students to choose Disciplinary and Open Electives wisely.
pygobject timetable python3-7 python3 pandas reportlab pypdf2 pdf
Language:Python 16
Parathantl / tesseract_gui
Simple Python GUI Tool for Tesseract4
tesseract-4 tesseract-ocr gui python pyqt5 pysimplegui ghostscript pypdf2
Language:Python 15
gugli28 / LocalEPaper
get local e paper ( Dainik Jagron and Hindustan )
qpython python webscraping cron automation bs4 pypdf2 urllib watchdog tqdm tor selenium-webdriver compression ubuntu smtp
Language:Python 14
Azazel0203 / MCQ_GENERATOR
The "MCQ Generator with Streamlit" web app utilizes OpenAI's language models to create multiple-choice questions (MCQs) from uploaded PDF or text files. Users can customize question parameters like quantity, subject, and tone. The app offers real-time complexity feedback and presents MCQs in an easy-to-read tabular format.
json langchain llm openai pypdf2 streamlit
Language:Python 11
IAmMaulik / Audiobook_Maker
You can convert from a PDF to MP3 file using this python code
python pdf audio audiobook hacktoberfest pypdf2 automatic pdfreader audiobook-maker amazon audible automation reading gtts google
Language:Python 11
lukefire5156 / PPTs_TO_PDFs_AND_Merger
A script to convert MS Office PPT/PPTX files to PDF files and then merge all the PDF files to a single PDF file.
ppts-to-pdf pdf-merger ppt-merger ppts-to-pdf-merger ppts ppt pptx python python-script merge-study-material pdf-merge merger merge-pdf merge-ppts-to-pdf pypdf2 pypdf2-lib pypdf comptypes-lib
Language:Python 11
ajinux / KCT-Academic-Calendar-Converter
A python script to convert the KCT's(Kumaraguru college of technology) academic calendar pdf file into a csv file and will sync the events with google calendar.
python scripting googleapi pypdf2 csv kumaraguru-college-of-technology
Language:Python 10
nigelreign / pdf-generator
A script that generates a pdf file. You can create a new pdf file from an html file or you can write on top of an already existing pdf
pdf-generation html python pdf reportlab pypdf2 pypdf2-lib pdfkit
Language:HTML 10
pyDF-Bot
nuhmanpk / pyDF-Bot
Pydf - Pyrogram Document File Bot, a modular Telegram Bot which provides Pdf Tools Works using Pypdf2
pyrogram pyrogram-bot telegram-bot telegram pdf pypdf2 pypdf pypdf2-lib tools bot
Language:Python 9
r96ahularya / PDF-Player
Simple python utilities to play around with PDF Files
pdf-player pdf-files pdf player pdf-viewer pdf-converter pdf-document-processor python python3 pypdf2 python-utilities pdf-document pdfkit pdf-manipulation pdf-merge pdfmerger mergepdf merge rotate rotatepdf
Language:Python 9
ZeusSama0001 / RAG-chatbot
RAG chatbot using Llama 2, chainlit and Faiss
chainlit faiss huggingface langchain llama2 llama2-7b pypdf2
Language:Python 9
Karthik-02 / plagiarism-detection
Provides a comprehensive solution for detecting plagiarism and finding similarities between text documents
cosine-similarity datavisualization docx2txt nltk-python pandas plagiarism plagiarism-checker plagiarism-detection plagiarism-detector plotly-express pypdf2 tokenization webscrapping-python
Language:Python 8
triposat / Python_Beginner_Level_Projects
This Repository consists of some Python Beginner Level Projects.
pyzbar pypdf2 cv2 colour module python3 projects readme basics-of-python mouse
Language:Python 8
ARUNAGIRINATHAN-K / Resume_Analyzer
Resume Analyzer
analyzer css html js pypdf2 python resume scikitlearn-machine-learning spacy sqlite nlp nltk-python resume-builder resume-analyzer resume-matching resume-parser
Language:Python 7
kezb90 / PDF_To_Word
A Python-based tool that converts PDF files into editable Word documents, preserving text, images, and layout. Uses PyPDF2, PyMuPDF (fitz), python-docx, and Pillow to accurately transfer content from PDF to .docx. Ideal for transforming complex PDFs into Word format for easy editing.
automation image-extraction pdf-conversion pdf-to-docx pdf-to-word pymupdf pypdf2 python python-docx python-script text-extraction
Language:Python 7

pypdf2

py-pdf / pypdf

pikepdf / pikepdf

py-pdf / benchmarks

ahmedkhemiri95 / PDFs-TextExtract

MicheleCotrufo / pdf2doi

chazeon / PDF-Watermark-Remover

doolieSoft / PdfPasswordRemover

gaborvecsei / Pdf-Split-Merge

py-pdf / PyPDF-Builder

TheWatcherMultiversal / pdfgui_tools

jiandandaoxingfu / pdfdo

shine-jayakumar / Extract-Data-From-PDF-In-Python

amitgupta4407 / All_About_PDF

santiago9631 / PDF-summarizer-chatbot-using-LLaMa2

Deba951 / Resume-ATS-Tracking-LLM-Project

sfneal / pdfconduit

nikhilkumarsingh / PDF_AUDIO_READER

crispyzingy / PDFExcelWordParser

viveksb007 / camscanner_watermark_remover

asepmaulanaismail / pdf-to-txt-python

jayeshbhandarkar / AgriGenius

PranjalGupta2199 / OFFLINE-ERP

Parathantl / tesseract_gui

gugli28 / LocalEPaper

Azazel0203 / MCQ_GENERATOR

IAmMaulik / Audiobook_Maker

lukefire5156 / PPTs_TO_PDFs_AND_Merger

ajinux / KCT-Academic-Calendar-Converter

nigelreign / pdf-generator

nuhmanpk / pyDF-Bot

r96ahularya / PDF-Player

ZeusSama0001 / RAG-chatbot

Karthik-02 / plagiarism-detection

triposat / Python_Beginner_Level_Projects

ARUNAGIRINATHAN-K / Resume_Analyzer

kezb90 / PDF_To_Word