spatie / pdf-to-text

Extract text from a pdf

Home Page:https://freek.dev/348-a-package-to-extract-text-from-a-pdf

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Different results in a Windows Local and Linux Prod

CasonWebDev opened this issue · comments

hi, i'm trying to read the same file i read in a local development windows on my centOS vpn production but the results are diferent... anyone knows how resolve this?

image
CENTOS PROD

image
WINDOWS LOCAL

This package uses pdftotext, so you may want to try the different options as listed in the manual to see if you get the desired results.

Perhaps the Windows port has some different defaults.

Try the layout and/or eol option.

The README includes usage instructions on how to pass options to pdftotext.

Ok... i solved the issue, the pdftotext package i've installed is different, i've followed the readme, but the pdftotext in the poppler-utils is not the pdftotext from xpdf library, that's was the problem
thanks for help.

@CasonWebDev I have the same issue, how do you solve it? I am using pdftotext.exe from XPDF on windows and using pdftotext to linux, but the result is different for them

@roomoraaes so... i'm not using the .exe file... i'm only using the .bin file from XPDF, and i'm running the application in a Linux environment, in my development environment i'm using the ubuntu bash in my windows, that way i can have the same result in production and in development.