bitextor / pdf-extract

PDF parser and converter to HTML

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

common.print use

lpla opened this issue · comments

Please, don't use common.print even for log and error outputs. This messes stdout when processing PDFs with the Stream overload Extract method.

Also, please make them optional, as I don't want to fill stderr or log files with the default "File: Input Stream, Start extract" and "File: Input Stream, Extract success." messages when processing thousand of PDFs.

The most annoying prints are those in PDFExtract/src/Main.java and PDFExtract/src/pdfextract/PDFExtract.java

I silenced all stdout prints from PDFExtract.jar in https://github.com/bitextor/python-pdfextract until this is fixed. Implemented in version 1.0.1.2 of the wrapper (https://github.com/bitextor/python-pdfextract)