Robert Sachunsky's repositories

sbb_web-integration

Visualization of NER+EL+Topic Modelling + Image-Search

Stargazers:0Issues:0Issues:0

sbb_images

Annotation Tool and Image Search

Stargazers:0Issues:0Issues:0

page2tsv

PAGE-XML to TSV

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sbb_utils

shared functionality

Stargazers:0Issues:0Issues:0

sbb_tools

Digitalized Collections of the Berlin State Library: ALTO-XML Processing Tools / batch NER + EL / BERT-pre-training

Stargazers:0Issues:0Issues:0

sbb_topic-modelling

Topic Modelling

Stargazers:0Issues:0Issues:0

sbb_knowledge-base

Wikidata + Wikipedia Knowledge-Base Extraction for EL-purposes

Stargazers:0Issues:0Issues:0

sbb_ocr_postcorrection

Two-Step Approach to OCR Post-Correction

License:Apache-2.0Stargazers:0Issues:0Issues:0

sbb_ned

Named Entity Disambiguation and Linking

License:Apache-2.0Stargazers:0Issues:0Issues:0

sbb_ner

Named Entity Recognition

License:Apache-2.0Stargazers:0Issues:0Issues:0

dh-datenkompetenz2024-ocr

Slides and materials for contribution to the Ringvorlesung DH in SS 24 at TUD

Language:CSSLicense:CC0-1.0Stargazers:0Issues:0Issues:0

gt-repo-scripts

XSLT and shell scripts for analyzing and creating GitHub pages of a ground truth repository. These are centrally managed and can be used by all repositories created with gt-repo-template (https://github.com/OCR-D/gt-repo-template).

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

ocrd-demo-2021-05-12

Demos for OCR-D presentation at OCR@vDHd

Language:HTMLStargazers:1Issues:0Issues:0

tesstrain

Train Tesseract LSTM with make

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dta-tools

Tools used in the project "Deutsches Textarchiv"

License:LGPL-3.0Stargazers:0Issues:0Issues:0

ocrd_monitor

Web frontend for ocrd_manager

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tessdoc

Tesseract documentation

Stargazers:0Issues:0Issues:0

tesserocr

A Python wrapper for the tesseract-ocr API

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

dta-lexdb-applications

formatting and integrating the Deutches Textarchiv dictionary into various applications

Language:MakefileStargazers:2Issues:0Issues:0

mkn-test-gt

meine DHd24-GT-Erfahrung

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

mygt

mydesc

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

htr-united

Ground Truth Resources for the HTR of patrimonial documents

License:CC0-1.0Stargazers:0Issues:0Issues:0
Language:ShellLicense:MITStargazers:0Issues:0Issues:0

ocrd_keraslm

Simple character-based language model using keras

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Mask_RCNN

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

Language:PythonLicense:NOASSERTIONStargazers:3Issues:0Issues:0

kraken

OCR engine for all the languages

License:Apache-2.0Stargazers:0Issues:0Issues:0

ddb-metadata-schematron-validation

Schematron-Validierungen der Fachstelle Bibliothek der Deutsche Digitalen Bibliothek

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:XSLTLicense:MITStargazers:0Issues:0Issues:0