asepmaulanaismail / pdf-to-txt-python

Simple pdf to text with python using PDFtk and PyPDF2

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PDF to Text with Python

Introduction

This program will:

  1. Split your PDF into pages,
  2. Extract the text from each pages, and
  3. Save them in .txt file.

Required

Run

$ python main.py <your-pdf-file>

Why Using PDFtk?

Because PyPDF2's extract function doesn't works on some files.

About

Simple pdf to text with python using PDFtk and PyPDF2


Languages

Language:Python 100.0%