MaxineXiong / Scraping-Scanned-PDF-Docs-using-OCR-with-RPA

This repository contains automation solutions that efficiently extracts text from scanned PDF documents with consistent layouts. Utilizing Tesseract OCR engine, the UiPath RPA robot achieves nearly 90% accuracy, streamlining the process and significantly reducing manual workload.

Repository from Github https://github.comMaxineXiong/Scraping-Scanned-PDF-Docs-using-OCR-with-RPARepository from Github https://github.comMaxineXiong/Scraping-Scanned-PDF-Docs-using-OCR-with-RPA

MaxineXiong/Scraping-Scanned-PDF-Docs-using-OCR-with-RPA Stargazers