Robert Sachunsky's repositories

ocrd_detectron2

OCR-D wrapper for detectron2 based segmentation models

workflow-configuration

a makefilization for OCR-D workflows, with configuration examples

Language:MakefileLicense:Apache-2.0Stargazers:9Issues:4Issues:19

mkn-kurrent-gt

Kurrent GT from the Moravian Knowledge Network handwritten periodicals

License:CC-BY-SA-4.0Stargazers:2Issues:2Issues:0

ocrd-demo-2021-05-12

Demos for OCR-D presentation at OCR@vDHd

Language:HTMLStargazers:1Issues:0Issues:0

browse-ocrd

An extensible viewer for OCR-D mets.xml files

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

core

Collection of OCR-related python tools and wrappers from @OCR-D

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

dh-datenkompetenz2024-ocr

Slides and materials for contribution to the Ringvorlesung DH in SS 24 at TUD

Language:CSSLicense:CC0-1.0Stargazers:0Issues:0Issues:0

dta-tools

Tools used in the project "Deutsches Textarchiv"

Language:XSLTLicense:LGPL-3.0Stargazers:0Issues:0Issues:0

eynollah

Document Layout Analysis

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

gt-repo-scripts

XSLT and shell scripts for analyzing and creating GitHub pages of a ground truth repository. These are centrally managed and can be used by all repositories created with gt-repo-template (https://github.com/OCR-D/gt-repo-template).

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

gt_structure_text

The OCR-D Ground Truth text and structure corpus was created between 2015 -2017. In the years since 2017, this corpus has been further curated and supplemented with metadata where appropriate. The corpus includes page XML files within annotations of the text and structure include.

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

ocrd_all

Master repository which includes most other OCR-D repositories as submodules

Language:MakefileLicense:MITStargazers:0Issues:1Issues:0

ocrd_cis

improved Ocropy1 and Post-correction

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

ocrd_kraken

Wrapper for the kraken OCR engine

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ocrd_tesserocr

Run tesseract with the tesserocr bindings with @OCR-D's interfaces

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

page2tsv

PAGE-XML to TSV

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sbb_images

Annotation Tool and Image Search

Stargazers:0Issues:0Issues:0

sbb_knowledge-base

Wikidata + Wikipedia Knowledge-Base Extraction for EL-purposes

Stargazers:0Issues:0Issues:0

sbb_ned

Named Entity Disambiguation and Linking

License:Apache-2.0Stargazers:0Issues:0Issues:0

sbb_ner

Named Entity Recognition

License:Apache-2.0Stargazers:0Issues:0Issues:0

sbb_ocr_postcorrection

Two-Step Approach to OCR Post-Correction

License:Apache-2.0Stargazers:0Issues:0Issues:0

sbb_textline_detection

Detect textlines in document images

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sbb_tools

Digitalized Collections of the Berlin State Library: ALTO-XML Processing Tools / batch NER + EL / BERT-pre-training

Stargazers:0Issues:0Issues:0

sbb_topic-modelling

Topic Modelling

Stargazers:0Issues:0Issues:0

sbb_utils

shared functionality

Stargazers:0Issues:0Issues:0

sbb_web-integration

Visualization of NER+EL+Topic Modelling + Image-Search

Stargazers:0Issues:0Issues:0

tesstrain

Train Tesseract LSTM with make

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

ulb-groundtruth-eval-odem-ger

OCR Grountruth ULB VD18 German Fraktur - OCR-D Phase III

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

ulb-groundtruth-eval-odem-lat

OCR Groundtruth ULB VD18 Latin - OCR-D Phase III

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0