barzik / pdftotextjs

pdftohtml nodejs wrapper

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

pdftotextjs - pdftotext shell wrapper for Node.js

view on npm npm module downloads Dependency Status Build Status view on npm

pdftotextjs provides access to pdftotext via shell in nodejs. You'll need pdftotext which comes with poppler-utils.

  • Ubuntu/Debian sudo apt-get install poppler-utils.
  • MacOSX sudo port install poppler or brew install xpdf.
  • Windows download and install Xpdf.

Installation

via npm:

$ npm install pdftotextjs

Usage

Asynchronous example

var pdftotext = require('pdftotextjs'),
    pdf = new pdftotext('test/pdfs/sample.pdf');

// Convert first page only
// These options will be passed to pdftotext
// You may use any valid option
pdf.add_options(['-f 1', '-l 1']);

pdf.getText(function(err, stdout, stderr) {
  if (err) {
    console.error(stderr);
  else {
    console.log(stdout);
  }
});

Synchronous example

var pdftotext = require('pdftotextjs'),
    pdf = new pdftotext('test/pdfs/sample.pdf');

var data = pdf.getTextSync(); // returns buffer
console.log(data.toString('utf8'));

Tests

$ npm test

Coverage

$ npm test-cov

About

pdftohtml nodejs wrapper

License:MIT License


Languages

Language:JavaScript 92.8%Language:Makefile 7.2%