Darkborderman / pycon_archive_past_website

Freeze and archive historical PyConTW official websites as static sites.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PRs Welcome Conventional Commits Code style: black

PyCon TW Archived Websites

The resting place for historical PyCon TW websites.

In order to better preserve things, we crawl each year's PyCon TW website into static HTML after the conference ends. The static websites are served on GitHub Page and the requests for historical websites are proxied from tw.pycon.org to GitHub Page with pycontw-nginx. See the crawling script on main branch and the archived HTML on gh-pages branch.

Getting Started

Prerequisites

Usage

  • Installing dependencies

    pipenv install
  • To scrap a specific PyCon TW website, insert the command

    pipenv run python main.py -y [YEAR] --base [BASE]

    where YEAR should be the path of website (e.g. 2020 for https://tw.pycon.org/2020) and BASE for the base URL or base path used in the static HTML.

  • To serve scraped static pages on your local machine, run the command:

    python -m http.server --cgi [PORT]

    and then access localhost:[PORT].

    or run the command:

    pipenv run serve

    and access localhost:5000.

Contributing

See Contributing

Authors

PyCon Taiwan

Created from Lee-W/cookiecutter-python-template version 1.1.2

About

Freeze and archive historical PyConTW official websites as static sites.

License:MIT License


Languages

Language:Python 100.0%