ocr ocr-android ocr-ios receipt-scanning textract barcode

vision-data

2D

Receipt scanning

Sample receipts from german grocery and hardware stores, drugstores, gas stations and other businesses. Receipts are cropped (subfolder A) and uncropped (B, C, D) 300dpi scans from a flatbed scanner. File names contain the major data points from the receipt (merchant, date of purchase, number of items, total amount due) so that they can also be used as simple ground truth cases for training receipt parsing or similar OCR applications. The naming scheme is "$MERCHANT_$DATE_$NUMITEMS_$AMOUNT.ext", e.g. "aldi_02032020_19_02423.jpeg".

For each scan, a JSON file is provided that contains the result of Amazon AWS Textract OCR for the respective receipt. The file contains the unaltered array of BLOCKS as returned from Textract. The naming of the JSON file follows the same scheme, only with an appended "blocks", e.g. "aldi_02032020_19_02423_blocks.json".

Extracting the lines of text from the json in two lines of Python:

content = json.load("aldi_02032020_19_02423_blocks.json")
lines = list(filter(lambda x : x['BlockType'] == 'LINE', content))

The lines list will then contain the individual line objects with text content, confidence, normalized position on the page etc.; raw JSON sample:

{"BlockType": "LINE", "Confidence": 98.87596130371094, "Text": "BIO HONIG", "Geometry": {"BoundingBox": {"Width": 0.20352864265441895, "Height": 0.015931174159049988, "Left": 0.01894291304051876, "Top": 0.2164158821105957}} ... }

Subset from our test cases for a receipt scanning app.

3D

None yet

About

Sample images and data for vision projects.

https://softmatic.com

ocr ocr-android ocr-ios receipt-scanning textract barcode