anjackson

Andy Jackson's repositories

anjackson.github.io

My personal website

Language:HTMLNOASSERTION100

the-turing-way

Host repository for The Turing Way: a how to guide for reproducible data science

NOASSERTION000

digipres.github.io

Auto-generated static web site digipres.org

Language:Jupyter Notebook000

codex

Language:Jupyter Notebook000

crawlcache

Experimental crawler proxy using a WARC-based cache.

Language:Python000

notebook

An experiment with managing Markdown docs.

000

wikipedia-sopa-blackout

A web archive of the Wikipedia homepage during the 2012 SOPA Blackout

Language:HTML000

digipres-notebook

Open notebook for digital preservation stuff hosted by GitBook

100

awesome-digital-preservation

Carefully curated list of awesome digital preservation resources.

Language:JavaScriptCC0-1.0000

drumknott

An experimental clerk.

AGPL-3.0000

datasette-lite

Datasette running in your browser using WebAssembly and Pyodide

Apache-2.0000

awesome-web-archiving

An Awesome List for getting started with web archiving

CC0-1.0000

ipywardley

Bringing Wardley Map magic to Jupyter notebooks

Language:Jupyter NotebookGPL-3.02000

ukwa-reports

Generating Reports

Language:TeX000

outbackcdx

Web archive index server based on RocksDB

Apache-2.0000

warcit

Convert Directories, Files and ZIP Files to Web Archives (WARC)

Language:PythonApache-2.0000

golem

Experimental crawler using Scrapy and Selenium

Language:PythonAGPL-3.0000

ukwa-monitor

Dashboard and monitoring system for the UK Web Archive

Language:Python000

browsertrix-crawler

Run a high-fidelity browser-based crawler in a single Docker container

AGPL-3.0000

sphinx-comments

hypothes.is interaction layer with Sphinx

MIT000

rclone-trials

Experimenting with Rclone and how it works with HDFS

Language:Shell000

timewarp

Making it easier to browse the past.

Language:PythonApache-2.0100

cdx-db

Generating Parquet files containing CDX data for SQL queries

Language:Python200

one-click-hugo-cms

Language:CSSMIT000

using-ffmpeg

Containerised ffpmeg and example Jupyter notebooks.

Language:Jupyter NotebookAGPL-3.0000

scrapy-url-frontier

A Scrapy module for URL Frontier integration

Language:PythonBSD-3-Clause100

anjackson

GitHub profile README

CC0-1.0000

ukwa-manage

Shepherding our web archives from crawl to access.

Language:Jupyter NotebookApache-2.0000

ebook-test-manifests

Language:HTML000

ukwa-services

Deployment configuration for all UKWA services stacks.

Language:PythonApache-2.0000