DieggoCarvalho / docvert-python3

Docvert for Python3: Converts Office files to DocBook and clean HTML, diagrams to SVG/PNG, etc.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Docvert

Converts Word Processor office files (e.g. .DOC files) to OpenDocument, DocBook, and structured HTML.

This is Docvert for Python 3. To find Docvert for Python 2.x see http://github.com/holloway/docvert/

Web Service

python3 ./docvert-web.py [-p PORT] [-H host]

Command Line

python3 ./docvert-cli.py

usage: docvert-cli.py [-h] [--version] --pipeline PIPELINE
    [--response {auto,path,stdout}]
    [--autopipeline {Break up over Heading 1.default,Nothing one long page}]
    [--url URL]
    [--list-pipelines]
    [--pipelinetype {tests,auto_pipelines,pipelines}]
    infile [infile ...]

Community

http://lists.catalyst.net.nz/mailman/listinfo/docvert

Requirements

Python 3
libreoffice
python3-uno
python3-lxml
python3-imaging
pdf2svg
librsvg2-2

Quickstart Guide

sudo apt-get install libreoffice python3-uno python3-lxml python3-imaging pdf2svg librsvg2-2 librsvg2-bin

/usr/bin/soffice --headless --norestore --nologo --norestore --nofirststartwizard --accept="socket,port=2002;urp;"

then in another terminal

cd ~

git clone git://github.com/holloway/docvert-python3.git

cd docvert-python3

python3 ./docvert-web.py

and browse to http://localhost:8080

LICENCE

Released under the GPL3 see LICENCE

About

Docvert for Python3: Converts Office files to DocBook and clean HTML, diagrams to SVG/PNG, etc.

License:GNU General Public License v3.0


Languages

Language:Python 55.4%Language:XSLT 31.5%Language:CSS 5.0%Language:JavaScript 4.9%Language:Smarty 2.1%Language:Shell 0.9%Language:HTML 0.2%