straussmaximilian / ocrmac

A python wrapper to extract text from images on a mac system. Uses the vision framework from Apple.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Method to get x/y location of OCR recognized text?

hicaicai opened this issue · comments

Is there a way to obtain the coordinates of text within the entire screen?

One can get the pixel coordinates for the respective image.
I now added functionality to get this directly with px=True.

Note that you might want to round the coordinates

Screenshot 2023-12-08 at 5 40 14 PM