fonckchain / pdf-text-converter

Python tool for converting PDF files to text. Simplify your document processing tasks.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Python

PDF to Text Conversion

This project contains a Python script that converts PDF files to text files

Description

The script scans specified directories for PDF files. For each PDF file, it extracts the text content and saves it as a new text file in a specified output directory.

πŸš€ Getting Started

πŸ” Dependencies

  • Python 3.7 or higher
  • pdfminer.six

πŸ“¦ Installation

  1. Clone the repository to your local machine.
  2. Install the required Python package: pip install pdfminer.six

πŸ› οΈ Using the Script

  1. Open the Python script (pdf_to_text.py) in a text editor.
  2. Modify the directories list with the paths to the directories containing your PDF files.
  3. Modify the output_dir variable with the path to the directory where you want to save the text files.
  4. Run the script: python pdf_to_text.py

πŸ“„ License

This project is licensed under the MIT License

About

Python tool for converting PDF files to text. Simplify your document processing tasks.


Languages

Language:Python 100.0%