felixlu07 / ocronpdf

This code extracts text from a PDF file using OCR, cleans it, and writes it to an Excel spreadsheet. It uses fitz, io, ocrmypdf, and pandas libraries to achieve this task.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

felixlu07/ocronpdf Watchers