Andreas Büttner's repositories
pagedir2pagexml
Command line tool to integrate ocropus results and ground truth in PageXML files
latin-bert-huggingface
Tokenizer config files to integrate Latin BERT in 🤗 transformers
ors2bryton
Convert routes from openrouteservice for bryton devices
pagexmllineseg
Some python functions to put text lines in LAREX PageXML files
altusi
the arabic-latin translations unified study interface
ArabicSOS
Segmenter and Orthography Standardazier (SOS) for Classical Arabic (CA)
calamari_demo
Instructional materials for the calamari OCR engine
cltk
The Classical Language Toolkit
csmtiser
A tool for text normalisation via character-level machine translation
HTR-models-es
Handwritten Text Recognition models for different historical collections
latinlp
Docker image for some Latin NLP tools
LEMLAT3
Morphological analyzer and lemmatizer for Latin.
morpheus
Morpheus parser
neuspell
NeuSpell: A Neural Spelling Correction Toolkit
punctuation-restoration
Punctuation Restoration using Transformer Models for High-and Low-Resource Languages
pydelta
an experimental implementation of Burrow's delta in Python 3
vdhd-2021-05-05
Demos for OCR-D presentation at OCR@vDHd
vscode-xml
XML Tools for Visual Studio Code