There are 1 repository under pytesseract-ocr topic.
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
Pytorch Implementation of TableNet
In this code, a simple implementation of PDF to audio converter is shown
Google Solution Challenge 2024. Team Cornflakes VIT Chennai
Convert pdf to audiobooks đź“š
Detect and extract containers code in a video.
Detect and scan the license plate number from vehicle images
Converts a whole subdirectory with a big (or small) volume of PDF documents to a dataset (pandas DataFrame) with error tracking and choice of features
A Python asyncio wrapper for Tesseract-OCR.
A PWA to make you aware of your spending habits
This is an OCR Bot for Discord made using OpenCV and Pytesseract
An interface to extract text from a video and convert it to speech
PDF Automation - OCR Text Recognition
Charla de web scraping sobre datos pĂşblicos de Chile
PROJECT(Image_detector)_using_python_Libraries
A simple demo to show the power of PyTesseract: Simple Python Optical Character Recognition
Flask ALPR is a web service for automatic license plate recognition (ALPR). The web service is written in Python using Flask for REST API and OpenCV with PyTesseract for plate recognition. The service offers two REST API-s, one for checking if licence plate is detected and one for detecting licence plate from camera image. All detected licence plates are automatically stored in a SQLite database using the SQLAlchemy library.
Medical Data Extraction By Pytesseract (Google Optical Character Recognition Engine) and Computer Vision
"Docs in a Row" is an automated script designed to handle image data extraction, correction, categorization, and storage. It utilizes a variety of technologies including OpenAI, Google Cloud Vision, pytesseract, and PIL to extract and correct text from images, categorize the content, and store useful metadata.
This is a set of python programs created to scrape the results of the students whose USN is provided to program which automatically solves the CAPTCHA and stores the result in a text file
Computer Vision and IOT projects (ML and DL techniques)
Capture is a python based desktop application that lets you capture the text which otherwise cannot be copied. It saves the time spent on software/website to get the text. Just select and voila, your text is copied to the clipboard.
A Python script that automates the game of Sudoku and enters it automatically into the website.
A tool that automizes the process of pulling data tables from PDF documents where they are as scans
Automatic Number Plate Recognition (ANPR) is commonly used in many countries for many applications like Ticketless parking fee management, car theft prevention etc. ANPR systems consists of three stages: Number Plate detection, character segmentation and character recognition.
This is a script for getting the Aswer of a Crossword Puzzle, using pyTesseract and Web Scraping
The program detects license plate numbers, colour, entry time and exit time of the vehicle. The parking-lot management module keeps tracks of empty parking slots and details of all vehicles entering and leaving are stores in a csv file. Complete end-to-end parking system management is carrid out.
Simple Streamlit application that parses the data from Invoice images and returns it in JSON format
Named Entity Extraction with OpenCV, Pytesseract, Spacy (OCR + NER), BIO Labelling
A site that uses ocr on pdfs and images to extract text.
Take a screenshot of your participants' list. Upload it. Select your excel sheet. That's all. Your attendance has been marked.
Applying OCR on manually selected Region of Interests (using mouse drag) for Text extraction from Images