Andreas Büttner (andbue)

andbue

Geek Repo

Github PK Tool:Github PK Tool


Organizations
Calamari-OCR

Andreas Büttner's repositories

nashi

Some bits of javascript to transcribe scanned pages using PageXML

Language:HTMLLicense:GPL-3.0Stargazers:17Issues:10Issues:3

kraken

Kraken fork using pytorch and warp-ctc instead of clstm

Language:PythonLicense:Apache-2.0Stargazers:5Issues:0Issues:0

pagedir2pagexml

Command line tool to integrate ocropus results and ground truth in PageXML files

Language:PythonLicense:GPL-3.0Stargazers:3Issues:3Issues:0

latin-bert-huggingface

Tokenizer config files to integrate Latin BERT in 🤗 transformers

Language:ShellLicense:MITStargazers:2Issues:2Issues:0

ors2bryton

Convert routes from openrouteservice for bryton devices

Language:PythonLicense:UnlicenseStargazers:1Issues:3Issues:1

page2tei

Python snippets that might be useful for exporting transcribed pages from PAGE XML to TEI XML

Language:PythonLicense:Apache-2.0Stargazers:1Issues:4Issues:0

pagexmllineseg

Some python functions to put text lines in LAREX PageXML files

Language:PythonLicense:GPL-3.0Stargazers:1Issues:3Issues:0

altusi

the arabic-latin translations unified study interface

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

ArabicSOS

Segmenter and Orthography Standardazier (SOS) for Classical Arabic (CA)

Language:PythonLicense:LGPL-3.0Stargazers:0Issues:0Issues:0

Bleualign

Machine-Translation-based sentence alignment tool for parallel text

Language:PythonLicense:GPL-2.0Stargazers:0Issues:1Issues:0

calamari

OCR Engine based on OCRopy and Kraken

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

calamari_demo

Instructional materials for the calamari OCR engine

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:2Issues:0

cltk

The Classical Language Toolkit

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

csmtiser

A tool for text normalisation via character-level machine translation

License:LGPL-3.0Stargazers:0Issues:0Issues:0

HTR-models-es

Handwritten Text Recognition models for different historical collections

Stargazers:0Issues:0Issues:0

LAREX

A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.

Language:JavaScriptLicense:GPL-3.0Stargazers:0Issues:2Issues:0

LAREXjs

JS port of the semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.

Language:JavaScriptLicense:GPL-3.0Stargazers:0Issues:2Issues:0

latinlp

Docker image for some Latin NLP tools

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

LEMLAT3

Morphological analyzer and lemmatizer for Latin.

Language:CStargazers:0Issues:0Issues:0

morpheus

Morpheus parser

Language:CStargazers:0Issues:0Issues:0

neuspell

NeuSpell: A Neural Spelling Correction Toolkit

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

norma

A tool for automatic spelling normalization

Language:C++License:LGPL-3.0Stargazers:0Issues:1Issues:0

punctuation-restoration

Punctuation Restoration using Transformer Models for High-and Low-Resource Languages

License:MITStargazers:0Issues:0Issues:0

pydelta

an experimental implementation of Burrow's delta in Python 3

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

vdhd-2021-05-05

Demos for OCR-D presentation at OCR@vDHd

Stargazers:0Issues:1Issues:0

vscode-xml

XML Tools for Visual Studio Code

Language:TypeScriptLicense:MITStargazers:0Issues:1Issues:0
Language:AdaStargazers:0Issues:0Issues:0