Thorben Schomacker (tschomacker)

tschomacker

Geek Repo

Location:Hamburg, Deutschland

Github PK Tool:Github PK Tool

Thorben Schomacker's repositories

aligned-narrative-documents

A collection of scripts to create a Document-aligned corpus of German Narrative Texts from four different sources of Simple Language Texts and three different sources of Standard Language Texts.

Language:PythonLicense:CC-BY-4.0Stargazers:2Issues:1Issues:0

BertSum

A fork of BertSum which uses Stanford Stanza for tokenizing. This makes it possible to tokenize a big variety of languages.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

churchtools-birthdays

Simple tool for automatically sending a list of people which had their birthdays within the last week generated from a churchtools database

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

generalizing-passages-identification-bert

Automatic Identification of Generalizing Passages in German Fictional Texts using BERT with Monolingual and Multilingual Training Data

Language:PythonLicense:MITStargazers:0Issues:3Issues:0
Language:HTMLLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

news-scraper

A program for downloading online articles and saving it in a SQLLite database.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

pyrouge-first-use

First Use of Rouge 1.5.5 / pyrouge in Python

Language:PythonStargazers:0Issues:1Issues:0

textgrid-domain-adaptation-dataset

A small script to mask textrgrid texts sentence by sentence and combine them into one dataset. This dataset can be used for masked language modeling and thus for pre-training and domain adaptation.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0