Mustufain / pypdf2xml

Convert text from PDF to XML.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

pypdf2xml

This project started as an alternative to poppler's pdftoxml, which didn't properly decode CID Type2 fonts in PDFs. This script requires pdfminer.

License

Public domain.

About

Convert text from PDF to XML.


Languages

Language:Python 100.0%