MaCoCu's repositories
LanguageModels
Tools for training LMs
BCMS-variant-classifier
A classification tool for discriminating between Bosnian, Croatian, Montenegrin, and Serbian
DSI
Code for the DSI experiments in the MaCoCu project
Language:Python000
HT-vs-MT
Source code for EAMT 2022 paper "Automatic Discrimination of Human and Neural Machine Translation: A Study with Multiple Pre-Trained Models and Longer Context".
Language:ShellMIT000
Manual-Checking-Web-Corpora-Guidelines
The Guidelines for Manual Checking of Web Corpora
Monolingual-Curation
The Repository for the Curation of Monolingual Data work package