moozilla / docfuture-scraper

Python tumblr -> epub scraper for docfuture.tumblr.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

docfuture-scraper

Python tumblr -> epub scraper for docfuture.tumblr.com.

The author of Doc Future has commented that they may publish the books commerically in the future, so please do not share the epubs created by this scraper publicly.

Useful Links:

Setup

Install dependencies

  • Make sure you have a valid python installation (tested on python 3.12)
  • Create a virtual environment
python3.12 -m venv venv
source venv/bin/activate
  • Install requirements
pip install -r requirements.txt

Create tumblr API key

consumer_key: YOUR_CONSUMER_KEY_HERE
consumer_secret: YOUR_CONSUMER_SECRET_HERE
oauth_token: # not necessary
oauth_token_secret: # not necessary

(Note: if you want an oauth token/secret you can use the interactive console script from pytumblr, but I don't think this is actually necessary for only reading posts)

Running the scraper

python3 scrape.py

This will scrape each post and compile the contents into an epub under output/docfuture.epub.

About

Python tumblr -> epub scraper for docfuture.tumblr.com

License:MIT License


Languages

Language:Python 100.0%