UK Web Archive (ukwa)

UK Web Archive

ukwa

Geek Repo

Location:United Kingdom

Home Page:http://www.webarchive.org.uk/

Github PK Tool:Github PK Tool

UK Web Archive's repositories

webarchive-discovery

WARC and ARC indexing and discovery tools.

docker-pdf2htmlex

Run pdf2htmlEX in a Docker container.

Language:PythonLicense:Apache-2.0Stargazers:21Issues:6Issues:1

w3act

w3act is an annotation and curation tool for building web archive collections

Language:JavaLicense:Apache-2.0Stargazers:19Issues:14Issues:667
Language:JavaScriptLicense:GPL-3.0Stargazers:11Issues:13Issues:106

ukwa-manage

Shepherding our web archives from crawl to access.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10Issues:9Issues:85

ukwa-heritrix

The UKWA Heritrix3 custom modules and Docker builder.

acid-crawl

An acid test suite for crawlers.

hapy

A Python wrapper around the Heritrix API.

Language:PythonLicense:Apache-2.0Stargazers:4Issues:9Issues:0

ukwa-services

Deployment configuration for all UKWA services stacks.

Language:PythonLicense:Apache-2.0Stargazers:4Issues:8Issues:86

webrender-puppeteer

Web page rendering service based on Google's Puppeteer

crawl-db

A standalone database for crawl events.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:8Issues:2

docker-airflow

Apache Airflow with a few additional dependencies

Language:DockerfileLicense:Apache-2.0Stargazers:1Issues:8Issues:0

docker-hadoop

Hadoop running in a container.

Language:DockerfileLicense:Apache-2.0Stargazers:1Issues:8Issues:1

python-w3act

Python clients for W3ACT and Heritrix3

ukwa-ui

A new user interface for the UK Web Archive

Language:JavaLicense:BSD-3-ClauseStargazers:0Issues:11Issues:316

backstage

UI for searching across internal services

Language:RubyStargazers:0Issues:8Issues:1

crawl-log-viewer

A simple web service for viewing crawl logs.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

crawl-streams

Tools for working with UKWA crawler event streams

Language:PythonLicense:Apache-2.0Stargazers:0Issues:8Issues:8

docker-clamd

ClamD in a container

Language:DockerfileLicense:Apache-2.0Stargazers:0Issues:0Issues:0

docker-robot-framework

A Dockerised Robot Framework execution environment.

Language:RobotFrameworkLicense:Apache-2.0Stargazers:0Issues:7Issues:12

docker-superset

Dockerized Apache Superset including Solr module

Language:ShellStargazers:0Issues:7Issues:1

kevals

Key-values data aggregator

Language:PythonLicense:Apache-2.0Stargazers:0Issues:8Issues:0

npld-access-stack

Service deployment setup for the Reading Room NPLD Access service

Language:HTMLStargazers:0Issues:6Issues:3

npld-player

Secured browser for accessing NPLD content in Legal Deposit Library reading rooms.

Language:TypeScriptLicense:MITStargazers:0Issues:8Issues:15
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

ukwa-monitor

Dashboard and monitoring system for the UK Web Archive

Language:PythonStargazers:0Issues:9Issues:22

ukwa-notebook-apps

UKWA web apps for working with internal APIs, build on Jupyter notebooks and Voila.

Language:Jupyter NotebookStargazers:0Issues:7Issues:2

ukwa-reports

Generating Reports

Language:TeXStargazers:0Issues:8Issues:0

ukwa-site

Using static site generation for parts of the our site.

Language:SCSSLicense:Apache-2.0Stargazers:0Issues:8Issues:14

ukwa-ui-collections-solr

Containerised version of the Solr service used to generate the UKWA UI collections browser

Language:PythonStargazers:0Issues:7Issues:0