Web archive of National Library of the Czech Republic (WebarchivCZ)

Web archive of National Library of the Czech Republic

WebarchivCZ

Geek Repo

Location:Prague

Home Page:http://www.webarchiv.cz

Github PK Tool:Github PK Tool

Web archive of National Library of the Czech Republic's repositories

Seeder

Seeder - Czech webarchive curating tool and public site

Language:PythonLicense:MITStargazers:15Issues:8Issues:568

Crawler-config

WebArchiv.cz crawler configuration.

Language:PHPLicense:NOASSERTIONStargazers:3Issues:2Issues:4

wa-tools

Scripts for managing web archive

Language:ShellStargazers:3Issues:5Issues:0

WAHarvester

Tool for managing crawls with Heritrix and WAdmin tool.

Language:GroovyLicense:NOASSERTIONStargazers:3Issues:2Issues:0

extinct-websites

Aplikace slouží jako automatizované řešení pro identifikaci a popis mrtvých webů. Následně je ukládá do vlastní databáze a zpřístupňuje kurátorům, kteří s informacemi v ní dále nakládají, interpretují je a obsah klasifikují.

WA-KAT

Catalogization tool for the czech webarchive.

Language:JavaScriptLicense:MITStargazers:2Issues:10Issues:106

WAmetadataHarvest

Project focused on harvesting metadata from Heritrix logs/archives and WA-admin tool.

Language:PHPStargazers:2Issues:5Issues:0

webanalyzer

webanalyzer

Language:JavaLicense:GPL-3.0Stargazers:2Issues:2Issues:0

naki

NAKI informační stránka

Language:HTMLStargazers:1Issues:7Issues:0

opensearch-to-srw-gate

Opensearch to SRU/SRW gate

Language:JavaLicense:NOASSERTIONStargazers:1Issues:2Issues:0

OpenWayback-devel

Webarchiv's Wayback Machine

databaze-mrtvych-webovych-zdroju

Dokumentace k Databázi mrtvých webových zdrojů.

Stargazers:0Issues:4Issues:0

grainery

Keeping knowledge about harvested ARC/WARCs and related files such as logs, CDX files etc.

Language:HTMLLicense:MITStargazers:0Issues:6Issues:5

katalogizacni-manual

Katalogizační manuál pro popis elektronických online zdrojů ve formátu MARC 21 podle pravidel RDA

Language:CSSLicense:MITStargazers:0Issues:7Issues:0

pywb

Nový věk zpřístupnění českého webového archivu.

Language:ShellStargazers:0Issues:4Issues:13

WACloud

Centralised interface for Webarchive data extraction and analysis

Language:TypeScriptLicense:GPL-3.0Stargazers:0Issues:4Issues:0

WACloud_Docs

User documentation for WACloud

License:GPL-3.0Stargazers:0Issues:3Issues:0

WWW

Legacy version of website of Czech web archive

Language:HTMLStargazers:0Issues:5Issues:3

cdx-server

Openwayback CDX Server build

Stargazers:0Issues:3Issues:0

continuous-suite

Continuous heritrix shell suite (CHSS)

Language:ShellLicense:GPL-3.0Stargazers:0Issues:1Issues:0

elasticsearch

Custom Elasticsearch for webarchiv.cz services.

Language:DockerfileStargazers:0Issues:3Issues:0

machines

Code for web archiving infrastructure

Language:RubyStargazers:0Issues:5Issues:6
Language:XSLTStargazers:0Issues:5Issues:0

WACloud_ArchiveProcessor

Analytical component of WACloud

Language:PythonStargazers:0Issues:4Issues:0

WACloud_ExportApp

WARC Export application

Language:PythonLicense:GPL-3.0Stargazers:0Issues:3Issues:0
Language:JavaLicense:NOASSERTIONStargazers:0Issues:2Issues:0
Language:CSSStargazers:0Issues:3Issues:0

WebBEAT

WebBEAT website data extractor

Language:ShellLicense:GPL-3.0Stargazers:0Issues:1Issues:0