GonzaloRizzo / pdf-text-parser

Parses PDF's text in a nice and simple format

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PDF Text Parser

Version Downloads Downloads/week License

Parses PDF's text in a nice and simple format

Usage

const fs = require('fs')
const pdfParser = require('pdf-text-parser');

const pdfData = fs.readFileSync('file.pdf');
pdfParser(pdfData)
  .then(parsedText => console.log(parsedText));

CLI Usage

$ pdf-text-parser file.pdf

Output Format

{
   "metadata":{
      "Title":"Example Output"
   },
   "pages":[
      [
         "Fist Page"
      ],
      [
         "Second Page"
      ]
   ]
}

About

Parses PDF's text in a nice and simple format


Languages

Language:JavaScript 100.0%