FormVision is a node.js library for extracting data from scanned forms.
- Extract text, barcodes and checkboxes from images
- Specify expected region, type and validation for each field
- Supports incorporation of domain specific knowledge (wrong place, right data)
- Supports multiple transformations (half printed, half written, different offsets)
- Meant to cut red tape!
$ npm install fv
Install fv
, download that image and that schema. Now run the command-line interface:
- Print raw data extracted from image (without matching).
coffee bin/cli.coffee --remove-red --lang=deu m10-printed.png ```
- Print form data extracted from image using the specified schema (with matching).
coffee bin/cli.coffee --remove-red --lang=deu --schema=m10-schema.json m10-printed.png ```
Here are some quick links to help you get started:
Licensed under the incredibly permissive MIT License. Copyright © 2013-2014 Christoph Schulz. Dependencies may be licensed differently.