18F / doc_processing_toolkit

Python library to extract text from PDF, and default to OCR when text extraction fails.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

18F/doc_processing_toolkit Issues