Internet Archive's repositories
analyze_ocr
Parse OCR result files for pagenos, tables of contents, etc.
read_api_extras
Demo code for the Open Library Read API
kohacon2011-presentation
Presentation for KohaCon 2011
corrections
web app for distributed proofreading of derived documents
fifthelephant2012-presentation
How Internet Archive Preserves Petabytes of Data
maxmind-geoip
Source for MaxMind's GeoIP-Python to install via pip
poppler-ia
Changes to poppler to get accurate coordinates from pdfs