jeturgavli / ImageToText

Screenshot Image to Text Data Extract into excel File

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Text Extraction and Excel Parsing from Images

This Python script extracts text from images using Tesseract OCR and organizes it into an Excel file.

Features

  • Automated Installation: Checks for required Python modules (pytesseract, openpyxl, pandas) and installs them if missing.
  • Text Extraction: Utilizes Tesseract OCR to extract text from images.
  • Data Parsing: Parses extracted text to extract contact names and times seen, organizing them into an Excel file.
  • Logging: Logs informative messages, warnings, and errors for better tracking and debugging.
  • User Interaction: Prompts the user for image and output folder paths, allowing for interactive usage.

Usage

  1. Ensure Python is installed.
  2. Install Tesseract OCR:
  3. Clone or download the repository.
  4. Place images to be processed in the images folder.
  5. Run the script (main.py).
  6. Follow the prompts to input image and output folder paths.
  7. View the generated Excel files in the output folder.

Dependencies

  • Python 3.x
  • Tesseract OCR
  • Required Python modules: pytesseract, openpyxl, pandas

Author

LAKSHMI

Contribution

JETUR GAVLI

License

This project is licensed under the MIT License.

About

Screenshot Image to Text Data Extract into excel File

License:MIT License


Languages

Language:Python 100.0%