Andy Jackson (anjackson)

anjackson

Geek Repo

Company:The Digital Preservation Coalition

Location:UK

Home Page:anjackson.net

Twitter:@anjacks0n

Github PK Tool:Github PK Tool


Organizations
iipc
openpreserve
uberconverter

Andy Jackson's repositories

anjackson.github.io

My personal website

Language:HTMLLicense:NOASSERTIONStargazers:1Issues:0Issues:0

the-turing-way

Host repository for The Turing Way: a how to guide for reproducible data science

License:NOASSERTIONStargazers:0Issues:0Issues:0

digipres.github.io

Auto-generated static web site digipres.org

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

crawlcache

Experimental crawler proxy using a WARC-based cache.

Language:PythonStargazers:0Issues:0Issues:0

notebook

An experiment with managing Markdown docs.

Stargazers:0Issues:0Issues:0

wikipedia-sopa-blackout

A web archive of the Wikipedia homepage during the 2012 SOPA Blackout

Language:HTMLStargazers:0Issues:0Issues:0

digipres-notebook

Open notebook for digital preservation stuff hosted by GitBook

Stargazers:1Issues:0Issues:0

awesome-digital-preservation

Carefully curated list of awesome digital preservation resources.

Language:JavaScriptLicense:CC0-1.0Stargazers:0Issues:0Issues:0

drumknott

An experimental clerk.

License:AGPL-3.0Stargazers:0Issues:0Issues:0

datasette-lite

Datasette running in your browser using WebAssembly and Pyodide

License:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-web-archiving

An Awesome List for getting started with web archiving

License:CC0-1.0Stargazers:0Issues:0Issues:0

ipywardley

Bringing Wardley Map magic to Jupyter notebooks

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:20Issues:0Issues:0

ukwa-reports

Generating Reports

Language:TeXStargazers:0Issues:0Issues:0

outbackcdx

Web archive index server based on RocksDB

License:Apache-2.0Stargazers:0Issues:0Issues:0

warcit

Convert Directories, Files and ZIP Files to Web Archives (WARC)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

golem

Experimental crawler using Scrapy and Selenium

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

ukwa-monitor

Dashboard and monitoring system for the UK Web Archive

Language:PythonStargazers:0Issues:0Issues:0

browsertrix-crawler

Run a high-fidelity browser-based crawler in a single Docker container

License:AGPL-3.0Stargazers:0Issues:0Issues:0

sphinx-comments

hypothes.is interaction layer with Sphinx

License:MITStargazers:0Issues:0Issues:0

rclone-trials

Experimenting with Rclone and how it works with HDFS

Language:ShellStargazers:0Issues:0Issues:0

timewarp

Making it easier to browse the past.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

cdx-db

Generating Parquet files containing CDX data for SQL queries

Language:PythonStargazers:2Issues:0Issues:0
Language:CSSLicense:MITStargazers:0Issues:0Issues:0

using-ffmpeg

Containerised ffpmeg and example Jupyter notebooks.

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

scrapy-url-frontier

A Scrapy module for URL Frontier integration

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0

anjackson

GitHub profile README

License:CC0-1.0Stargazers:0Issues:0Issues:0

ukwa-manage

Shepherding our web archives from crawl to access.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

ukwa-services

Deployment configuration for all UKWA services stacks.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0