Thorben Schomacker's repositories
aligned-narrative-documents
A collection of scripts to create a Document-aligned corpus of German Narrative Texts from four different sources of Simple Language Texts and three different sources of Standard Language Texts.
churchtools-birthdays
Simple tool for automatically sending a list of people which had their birthdays within the last week generated from a churchtools database
generalizing-passages-identification-bert
Automatic Identification of Generalizing Passages in German Fictional Texts using BERT with Monolingual and Multilingual Training Data
news-scraper
A program for downloading online articles and saving it in a SQLLite database.
pyrouge-first-use
First Use of Rouge 1.5.5 / pyrouge in Python
textgrid-domain-adaptation-dataset
A small script to mask textrgrid texts sentence by sentence and combine them into one dataset. This dataset can be used for masked language modeling and thus for pre-training and domain adaptation.