National Library of the Netherlands / Research's repositories
xml-workshop
Automatically extract text, layout and metadata information from XML-files of OCR-ed historical texts
DBNL-canonicity
KB RiR project to Collect a corpus of Dutch novels 1800-2000 and Investigate Canonicity
textExtractDemo
Text extraction demo
Demosaurus
Demo web application that supports author attribution (thesaureren) and topic attribution (subject indexing). Annif is used for the latter.
IwI22_ARTIST
This repository contains the Jupyter Notebooks and other information as created during ICT With Industry 2022
zenodoReports
Fetch metadata and generate reports for a Zenodo community.
isbnlib-kb
A metadata plugin for isbnlib using the service of the KB (National Library of the Netherlands).
OpenRefine-Wikibase
Files for interaction between OpenRefine and KB Wikibases
detectStorageMediaType
Storage media type detection using Python and the Windows API
epub2to3
Epub 2 to Epub 3 conversion workflow
heritrix3-crawler-status-reporting-fix
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
jhove-rest
REST Wrappings for JHOVE
pdf-characterisation
Scripts and raw results of PDF characterisation experiments
SANE-blind
A repository to try out the SANE environment
wikibase-api
📦 Wrapper Python library for the Wikibase API
xxLINK-resources
Documentation and scripts related to xxLINK web sites recovery efforts