katalyst-labs / uncanny-magazine-crawler

Build an ePub from articles published on www.uncannymagazine.com.

Repository from Github https://github.comkatalyst-labs/uncanny-magazine-crawlerRepository from Github https://github.comkatalyst-labs/uncanny-magazine-crawler

Uncanny Magazine Crawler

Intro

Crawl and scrape articles from uncanny-magazine.com, and convert them into a proper retail-like ePub file. The free online content is released in two stages bi-monthly.


Known Issues and Limitations

  • The script uses the Ebooklib library to create the final ePub file, the library has issues due to it's use of os.path to create the file paths for the ePub file. Namely this is an issue on windows based systems, which use ' \ ' to denote file paths. This is fixed by monkey-patching os.path calls with os.posixpath.

  • The Script currently does not support issues, which have not been released completely.

  • Certain Articles have extra spaces or formatting artificats from lxml. These will be patched at a later date


Disclaimer

This software was written for educational and private-use.

If you like the content Please Support the Author's and Site by purchasing a copy or through donations.

About

Build an ePub from articles published on www.uncannymagazine.com.


Languages

Language:Python 95.2%Language:HTML 4.1%Language:CSS 0.7%