nilyang / docker-pdf2htmlex

Run pdf2htmlEX in a Docker container.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

pdf2htmlEX in Docker

A python web service that runs pdf2htmlEX on a URL, based on the pdf2htmlEX Debian package.

Usage:

$ docker run -ti -p 5000:5000 ukwa/pdf2htmlex

This runs an instance on port 5000 with a terminal connection so you can Ctrl-C to stop it.

Once running, you can use it as follows. To convert a whole PDF:

$ curl http://localhost:5000/convert?url=http://stlab.adobe.com/wiki/images/d/d3/Test.pdf

There are also optional parameters: first_page (defaults to 1) and last_page

TODO

  • Suggest we switch to nigit and reduce the code we have to use.

Change Log

  • 1.0.1:
    • Switch to gunicorn following instability in production.

About

Run pdf2htmlEX in a Docker container.

License:Apache License 2.0


Languages

Language:Python 84.1%Language:Dockerfile 15.9%