Peter Williams's repositories
pdf-search
Programs for searching PDF files.
Butt-Head-Astronomer
They laughed at Columbus, they laughed at Fulton, they laughed at the Wright Brothers. But they also laughed at Bozo the Clown.
CUTIE
CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)
datastore
Access Google DataStore
image-testing
Evaluation of imaging code
jbig2enc
JBIG2 Encoder
leptonica
Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The official github repository for Leptonica is: danbloomberg/leptonica. See leptonica.org for more documentation and recent releases.
Little-CMS
A free, open source, CMM engine. It provides fast transforms between ICC profiles.
ocropy
Python-based tools for document analysis and OCR
pdf-benchmarks
Benchmarks for evaluating PDF processing programs
pdf-corpora
Some PDF corpora
polyclip-go
Go library for Boolean operations on 2D polygons.
Representation-Learning-for-Information-Extraction
Pytorch implementation of Paper by Google Research - Representation Learning for Information Extraction from Form-like Documents.
ToneRanger
A program to detect the tone of documents.
unidoc-examples
Examples for UniDoc
unipdf
Golang PDF library for creating and processing PDF files (pure go)