antoniocoratelli / web-clipper

Easily download the main content of a web page in html, markdown, and/or epub format from command line.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

web-clipper

Easily download the main content of a web page in html, markdown, and/or epub format from command line.

Dependencies

This project was developed on Ubuntu 14.04, but should run smoothly on any Linux distribution with python3 and virtualenv support.

sudo apt-get -y install \
    git-core \
    python3 python3-pip python-virtualenv \
    python-tk openjdk-7-jdk libxml2-dev libxslt1-dev pandoc \
    && \
sudo apt-get -y build-dep \
    python-imaging \
    && \
pip3 install --user --upgrade \
    pip virtualenv

Setup

git clone https://github.com/antoniocoratelli/web-clipper.git && cd web-clipper && ./init-venv.sh

Usage

See script help: ./web-clipper -h

About

Easily download the main content of a web page in html, markdown, and/or epub format from command line.

License:BSD 3-Clause "New" or "Revised" License


Languages

Language:Python 90.7%Language:Shell 9.3%