AdamOlszewskiIT / pdf2html-http-parser

This tool allows to parse pdf into html files via http requests to gevent server with WSGI.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PDF2HTML Service

Transforms pdf into html files via http request handled by gevent server.

Setup

  1. Setup virtual env
python3 -m venv pdf2html
  1. Setup dependencies
pip install -r requirements.txt

Run it

  1. Run server
python pdf_2_html_service
  1. Build docker image
docker build -t pdf2html .
  1. Run docker container
docker run -itd -p 9088:9088 pdf2html

Usage

  1. Test with curl
curl --form file='@/path/to/your/pdf' http://0.0.0.0:9088/parse

About

This tool allows to parse pdf into html files via http requests to gevent server with WSGI.

License:Other


Languages

Language:Python 50.1%Language:Dockerfile 47.9%Language:Shell 2.0%