Auger is a GUI OCR tool for extracting text from images.
Have a screenshot but need it as a text file? Then, Auger is the tool for you!
You can select multiple regions of text within an image and format the results yourself.
Auger offers you two ways of formatting your output within the program:
- HTML, with both a WYSIWYG and raw code view
- Text, with font and font size customizable
Any image format compatible with the Qt library is compatible with Auger.
Languages supported by your OCR backend (e.g.: Tesseract) are supported by Auger. Pick the language, select part of the image, and boom! It's that simple.
Auger supports output into the following formats:
- Plain Text
- HTML
Installing Auger is easy...
Auger is available in binary distributions for both Windows and Linux. You can get them here.
You can always just clone the repo and setup a virtual environment for the purpose of running Auger:
git clone https://github.com/m-flak/auger auger
cd auger
pip install -r requirements.txt
python auger.py
git clone https://github.com/m-flak/auger auger
cd auger
python setup.py build # YOU CAN USE ANY COMMAND SUPPORTED BY SETUPTOOLS
cd build/lib; python -m auger
- PyQt5
- Pillow
- pyocr
- lxml
- iso-language-codes