ÚFAL's repositories
marian-tensorboard
a simple tool to parse marian training logs and display them in tensorboard
perl-pmltq
Query engine and query language for trees in PML format
ambiguity-grammaticality-complexity
Code for the paper Sentence Ambiguity, Grammaticality and Complexity Probes
eyetracked-multi-modal-translation
EMMT (Eyetracked Multi-Modal Translation), a simultaneous eye-tracking, 4-electrode EEG and audio corpus for multi-modal reading and translation scenarios
lindat-aai-attributes
Parse shibboleth logs for important information about attributes from IdPs and other
uk-cs-data-scripts
Scripts for processing data for Czech-Ukrainian MT
europarlmin
Corpus of European Parliament debates organized as a corpus for meeting summarization, i.e. matching full transcripts and minutes from the sessions. Used in the shared task of AutoMin 2023.
wmt22-term-based-metric
WMT22 Test Suite on terminology
GEM-metrics
Automatic metrics for GEM tasks
lindat-aai-attribute-aggregator
CLARIN+ AAI released attribute aggregator
oplatek-clustershgit
Configs, tips, and tricks how to work on the Ufal cluster. See README-ufal.md!
UD_Czech-Poetry
manual annotation of 19th-century Czech poetry
UD_Czech-Poetry-1
manual annotation of 19th-century Czech poetry
zotero-to-riv
Bibliography reporting for LINDAT