National Library of the Netherlands / Research's repositories
KB-python-API
Python API for KB data-services
omSipCreator
Create ingest-ready SIPs from batches of optical media images
openjpeg-decoder-service
A java based jp2 decoder service.
detectDamagedAudio
Tests on how to detect damaged WAV files
xs4all-resources
Scripts and documentation related to the xs4all homepage rescue efforts
Annif_data_exp
Automatic subject assignment for KB ebooks using Annif.
EntangledHistories
Processing of Transkribus output using xslt and running it through Annif
Europeana-Full-Text-in-Python
Various Python scripts to assist with searching and downloading full text records via the Europeana APIs.
Annif
Annif is a multi-algorithm automated classification and subject indexing tool for libraries, archives and museums. This repository is used for developing a production version of the system, based on ideas from the initial prototype.
Annif-documentation
Reports about the experiments using Annif
BERT-NER
Pytorch-Named-Entity-Recognition-with-BERT
Brinkeys
Automatic metadating: assigning Brinkman keywords
Brinkman-catalogus
The data and code accomanying my research master thesis: Exploring text mining techniques tostructure a digitised catalogue.
children_book_data
The data and the annotations from crowdsourcing are stored in this repository.
dbnl
Scripts to work with the Public Domain files of DBNL: https://www.dbnl.org/letterkunde/pd/index.php
dbnl-scripts
Scripts to scrape DBNL and work with the texts.
delpher_demo
This repository contains Jupyter Notebooks, code and a test data set to replicate the analyses of the website http://delpher_demo.kbresearch.nl
digger
DIGGER dataset code
gdmodule
Python GD module, originally by Richard Jones
geolocatedomains
Geolocate list of web domains
iromlab-socketclient
Socket client demo for Iromlab
mobile-apps
Resources, documentation on long-term preservation and access mobile apps
SaveToWaybackMachine
Saving URLs of Leesplein.nl to Wayback Machine of The Internet Archive
WordEmbeddingPlayground
Repository made by Kaspar van Beelen during his research-in-residence at the KB.