volmaster / scrapy_tesseract

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

scrapy_tesseract

This spider uses Scrapy-Splash to render the page and simulates click on the button to show a phone number. This number placed as the image that encoded via base64 and we need to decode that image and use optical character recognition library Tesseract to convert integers from image to a string.

Demo of spider work: https://youtu.be/51xiw_PVolA

About


Languages

Language:Python 100.0%