kbrbe / beltrans-data-integration

Creating a FAIR Linked Data corpus for the BELTRANS research project about Belgian book translations NL-FR and FR-NL between 1970 and 2020

Home Page:https://www.kbr.be/en/projects/beltrans/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

BELTRANS data integration

This repository contains code for the data integration of the BELTRANS project which studies Intra-Belgian translation flows between French and Dutch in the period 1970-2020. Data from different heterogenous data sources is integrated to create a FAIR corpus, this includes XML files from different sources but also existing large RDF dumps.

Preprocessing scripts for the different data sources are stored in the respective data-source folder. This mainly includes Python scripts but also RML mapping documents. The data integration is currently controlled by a bash script in the data-integration folder.

BELTRANS data integration overview

About

Creating a FAIR Linked Data corpus for the BELTRANS research project about Belgian book translations NL-FR and FR-NL between 1970 and 2020

https://www.kbr.be/en/projects/beltrans/

License:MIT License


Languages

Language:Jupyter Notebook 74.7%Language:Python 20.2%Language:Shell 5.0%Language:Dockerfile 0.0%