pytesseract-ocr

There are 1 repository under pytesseract-ocr topic.

NanoNets / ocr-python
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
ocr pdf-to-csv searchable-pdf tesseract extract-text-from-image extract-text-from-pdf image-to-text-converter pdf pytesseract-ocr python table-extract textract pdf-to-json pdf-to-text extract-table image-to-text
Language:Jupyter Notebook 63
asagar60 / TableNet-pytorch
Pytorch Implementation of TableNet
pytorch deep-learning tablenet ocr pytesseract-ocr
Language:Jupyter Notebook 57
shayanalibhatti / Designing-a-PDF-Audiobook-using-Python
In this code, a simple implementation of PDF to audio converter is shown
python python3 pdf-to-audio pdf-reader pytesseract pymupdf gtts audio-converter pdf-text pytesseract-ocr
Language:Python 41
Team-Cornflakes / VitaFile
Google Solution Challenge 2024. Team Cornflakes VIT Chennai
bert django gemini-pro gemini-pro-vision palm pytesseract-ocr react translate-api
Language:JavaScript 25
bhavita / Auto-Audio-Books
Convert pdf to audiobooks 📚
audiobooks pdf pytesseract-ocr google-text-to-speech pdf-to-audiobook
Language:Jupyter Notebook 21
lamnguyenkhoa / container-code-recognition
Detect and extract containers code in a video.
computer-vision darknet deep-learning object-detection opencv-python pycharm pytesseract-ocr recognition truck yolov4
Language:Python 21
radioactive11 / ALPR-India
Detect and scan the license plate number from vehicle images
cnn opencv pytesseract-ocr
Language:Python 19
icaropires / pdf2dataset
Converts a whole subdirectory with a big (or small) volume of PDF documents to a dataset (pandas DataFrame) with error tracking and choice of features
python3 ray distributed-systems distributed-computing parallel pdf data-science parquet tesseract-ocr tesseract ocr pytesseract pytesseract-ocr pdf2image pdftotext python pandas-dataframe pyarrow
Language:Python 17
amenezes / aiopytesseract
A Python asyncio wrapper for Tesseract-OCR.
ocr tesseract asyncio tesseract-ocr optical-character-recognition text-extraction pdftotext pytesseract pytesseract-ocr
Language:Python 16
ShyrenMore / cashflow-frontend
A PWA to make you aware of your spending habits
axios chartjs expense-tracker financial-data money-tracker nanonets-api ocr-text-reader opencv personal-finance pwa pytesseract-ocr react react-chartjs-2 react-hooks react-query react-router reactjs
Language:JavaScript 13
goldenryu2000 / Discord-OCR-Bot
This is an OCR Bot for Discord made using OpenCV and Pytesseract
ocr discord-bot ocr-discord-bot ocr-python ocr-recognition pytesseract-ocr pytesseract discord python python3 bot heroku heroku-deployment heroku-app ocr-bot opencv hacktoberfest
Language:Python 10
deepshig / Textual-Video-to-Speech-Interface
An interface to extract text from a video and convert it to speech
text-to-speech text-analysis video-processing image btech-project btech-project-proposal optical-character-recognition python mosaic-images python-mosaic mosaic image-binarization pytesseract-ocr pytesseract google-text-to-speech undergraduate-project computer-science-project
Language:Python 9
nayyhah / PDFAutomation-OCRTextRecognition
PDF Automation - OCR Text Recognition
dell-hack2hire-hackathon opencv-python remove-watermark image-alignment text-extraction text-extraction-from-image pytesseract-ocr pdf-to-image
Language:JavaScript 8
moebius-analitica / meetup-webscraping
Charla de web scraping sobre datos públicos de Chile
pytesseract-ocr python selenium-webdriver selenium-python beautifulsoup tabula-py
Language:Python 7
7410abhi / Image_detector-using-python-libraries
PROJECT(Image_detector)_using_python_Libraries
pil pillow python3 pytesseract pytesseract-ocr kraken opencv-python python-project python-programming python-libraries
Language:Python 6
bhattbhavesh91 / pytesseract-demo
A simple demo to show the power of PyTesseract: Simple Python Optical Character Recognition
optical-character-recognition ocr python python-tesseract pytesseract pytesseract-ocr demo
Language:Jupyter Notebook 6
SanjinKurelic / FlaskALPR
Flask ALPR is a web service for automatic license plate recognition (ALPR). The web service is written in Python using Flask for REST API and OpenCV with PyTesseract for plate recognition. The service offers two REST API-s, one for checking if licence plate is detected and one for detecting licence plate from camera image. All detected licence plates are automatically stored in a SQLite database using the SQLAlchemy library.
flask sqlalchemy python-dateutil opencv opencv-python imutils numpy pytesseract-ocr pytesseract tesseract-ocr sqlite3 tesseract python-ocr flask-sqlalchemy sqlite-python alpr
Language:Python 6
prathyyyyy / Medical-Data-Extraction
Medical Data Extraction By Pytesseract (Google Optical Character Recognition Engine) and Computer Vision
computer-vision fastapi pdf2image pytesseract pytesseract-ocr pytest python
Language:Jupyter Notebook 5
ScottStevenWhite / DocsInARow
"Docs in a Row" is an automated script designed to handle image data extraction, correction, categorization, and storage. It utilizes a variety of technologies including OpenAI, Google Cloud Vision, pytesseract, and PIL to extract and correct text from images, categorize the content, and store useful metadata.
openai openai-api pytesseract-ocr vision-api good-first-issue
Language:Python 5
Agasthya7 / VTU-Result_Scraper-CAPTCHA_Bypass
This is a set of python programs created to scrape the results of the students whose USN is provided to program which automatically solves the CAPTCHA and stores the result in a text file
cgpa-calculator excel-export keras-models pytesseract-ocr python3 vturesults webdriver
Language:Python 4
Crazy2code15 / Computer-Vision-and-IOT-Internship--TSF-Projects
Computer Vision and IOT projects (ML and DL techniques)
machine-learning opencv pandas matplotlib pyplot yolov3 pytesseract-ocr tessaract-ocr numpy argparse imutils random time
Language:Jupyter Notebook 4
crizbae / PictoPlan
PictoPlan streamlines lesson planning for educators. It automates converting visuals into educational content, saving time and ensuring quality teaching.
cloudflare css education fastapi flask heroku html openai-api pytesseract-ocr python
Language:Python 4
IshaanOhri / Capture
Capture is a python based desktop application that lets you capture the text which otherwise cannot be copied. It saves the time spent on software/website to get the text. Just select and voila, your text is copied to the clipboard.
ocr image-to-text python crop crop-image pytesseract pytesseract-ocr screenshot screencapture
Language:Python 4
MusadiqPasha / Sudoko-Solver
A Python script that automates the game of Sudoku and enters it automatically into the website.
pyautogui pytesseract-ocr python3 sudoku-game sudoku-solver
Language:Python 4
pavtiger / Parse-tables-from-PDF
A tool that automizes the process of pulling data tables from PDF documents where they are as scans
pdf python webserver opencv pytesseract pytesseract-ocr socketio
Language:Python 4
Prince2124 / Automatic-License-Number-Plate-Recognition-ANPR-
Automatic Number Plate Recognition (ANPR) is commonly used in many countries for many applications like Ticketless parking fee management, car theft prevention etc. ANPR systems consists of three stages: Number Plate detection, character segmentation and character recognition.
python image-processing image-recognition opencv pytesseract-ocr tesseract-ocr computer-vision
4
saccofrancesco / crosswords-solver
This is a script for getting the Aswer of a Crossword Puzzle, using pyTesseract and Web Scraping
python automation webscraping crossword-solver pytesseract-ocr image-processing textrecognition
Language:Python 4
Abhishekmishra-17 / Vehicle-plate-number-detection-using-python
plate-recognition plate-number plate-detection pytesseract-ocr python license-plate-recognition
Language:Python 3
ardraayyappath / smart-parking-system
The program detects license plate numbers, colour, entry time and exit time of the vehicle. The parking-lot management module keeps tracks of empty parking slots and details of all vehicles entering and leaving are stores in a csv file. Complete end-to-end parking system management is carrid out.
plate-number opencv pytesseract-ocr parking-management parking-lot
Language:Python 3
Jishnnu / InvoiceAI-Document-Parser
Simple Streamlit application that parses the data from Invoice images and returns it in JSON format
doctr imutils jina-chat keras-ocr kor langchain machine-learning matplotlib mindee numpy opencv-python pytesseract-ocr streamlit-webapp
Language:Jupyter Notebook 3
kishdubey / scheduley
take a screenshot of your course time table -> get back an ICS file to import into your digital calendar of choice
python flask pytesseract-ocr bootstrap ics calendar
Language:Python 3
Manav1918 / OCRApp
OCRApp is simple Image to Text converter GUI App using PyQt5, openCV and pytesseract
pytesseract-gui pytesseract pytesseract-ocr pyqt5-desktop-application python opencv opencv-python
Language:Python 3
MvMukesh / AutoKYC-ExtractionEngine
Named Entity Extraction with OpenCV, Pytesseract, Spacy (OCR + NER), BIO Labelling
datapreprocessing ner nlp opencv pytesseract spacy-nlp bio-tagging regular-expressions ocr-service computer-vision deep-learning flask-application pytesseract-ocr labelling bert-ner
3
Sabretooth1405 / pdf_reader
A site that uses ocr on pdfs and images to extract text.
django pdf2image pytesseract-ocr python pytorch
Language:Python 3
SpaceTesla / attendance-management-using-image-processing
Take a screenshot of your participants' list. Upload it. Select your excel sheet. That's all. Your attendance has been marked.
python tkinter-gui pytesseract-ocr openpyxl pillow
Language:Python 3
Tonumoy / OCR-on-Image-ROI-with-Tesseract
Applying OCR on manually selected Region of Interests (using mouse drag) for Text extraction from Images
image image-processing imagetotext ocr opencv pytesseract pytesseract-ocr python python3 spyder
Language:Python 3

pytesseract-ocr

NanoNets / ocr-python

asagar60 / TableNet-pytorch

shayanalibhatti / Designing-a-PDF-Audiobook-using-Python

Team-Cornflakes / VitaFile

bhavita / Auto-Audio-Books

lamnguyenkhoa / container-code-recognition

radioactive11 / ALPR-India

icaropires / pdf2dataset

amenezes / aiopytesseract

ShyrenMore / cashflow-frontend

goldenryu2000 / Discord-OCR-Bot

deepshig / Textual-Video-to-Speech-Interface

nayyhah / PDFAutomation-OCRTextRecognition

moebius-analitica / meetup-webscraping

7410abhi / Image_detector-using-python-libraries

bhattbhavesh91 / pytesseract-demo

SanjinKurelic / FlaskALPR

prathyyyyy / Medical-Data-Extraction

ScottStevenWhite / DocsInARow

Agasthya7 / VTU-Result_Scraper-CAPTCHA_Bypass

Crazy2code15 / Computer-Vision-and-IOT-Internship--TSF-Projects

crizbae / PictoPlan

IshaanOhri / Capture

MusadiqPasha / Sudoko-Solver

pavtiger / Parse-tables-from-PDF

Prince2124 / Automatic-License-Number-Plate-Recognition-ANPR-

saccofrancesco / crosswords-solver

Abhishekmishra-17 / Vehicle-plate-number-detection-using-python

ardraayyappath / smart-parking-system

Jishnnu / InvoiceAI-Document-Parser

kishdubey / scheduley

Manav1918 / OCRApp

MvMukesh / AutoKYC-ExtractionEngine

Sabretooth1405 / pdf_reader

SpaceTesla / attendance-management-using-image-processing

Tonumoy / OCR-on-Image-ROI-with-Tesseract