bfishadow / SBB

A Python script to download all articles from any Sina Blog blogger.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SBB

SBB(Sina Blog Book) is a script to download all artcles from a specific Sina Blog blogger.

Based on these downloaded HTML files, you may generate an ebook by importing into Calibre. Or, you can simply save them anywhere as archives.

Tested with Python 2.7.8

Usage

SBB.py (Sina Blog URL) (asc|desc)

The sort order argument is optional. By default, articles will be sorted chronically (asc).

Example:

Roadmap

  • Add a datestamp for Index.html
  • 'SELECT * FROM AllBlogPosts ORDER BY DatePosted DESC / ASC'
  • Download the embed pictures
  • Intergrate Calibre lib to generate eBook in one place

License

Licensed under the Apache License, Version 2.0

中文

SBB(Sina Blog Book) 是一个用于下载指定新浪博客作者全部文章的脚本。

基于这些下载来的 HTML 文件,您可以借助 Calibre 来生成电子书,或者当作存档。

请在 Python 2.7.8 下使用。

用法

SBB.py (新浪博客地址) (desc|asc)

排序开关是可选的,默认为按发表时间顺序排列(即 asc)。

例子:

Roadmap

  • 首页增加时间戳
  • 'SELECT * FROM AllBlogPosts ORDER BY DatePosted DESC / ASC'
  • 同时下载图片选项
  • 整合 Calibre 库,一站式打包成 .mobi 或 .ePub

授权

Licensed under the Apache License, Version 2.0

About

A Python script to download all articles from any Sina Blog blogger.


Languages

Language:Python 100.0%