QURATOR-SPK (qurator-spk)

QURATOR-SPK

qurator-spk

Geek Repo

IDM4 Data Science @StaatsbibliothekBerlin

Location:Berlin

Home Page:https://ravius.sbb.berlin

Github PK Tool:Github PK Tool

QURATOR-SPK's repositories

eynollah

Document Layout Analysis

Language:PythonLicense:Apache-2.0Stargazers:315Issues:18Issues:77

sbb_textline_detection

Detect textlines in document images

Language:PythonLicense:Apache-2.0Stargazers:81Issues:10Issues:30

sbb_binarization

Document Image Binarization

Language:PythonLicense:Apache-2.0Stargazers:66Issues:6Issues:30

dinglehopper

An OCR evaluation tool

Language:PythonLicense:Apache-2.0Stargazers:55Issues:5Issues:76

neat

Named entity annotation tool

Language:JavaScriptLicense:Apache-2.0Stargazers:27Issues:6Issues:47

sbb_ner

Named Entity Recognition

Language:PythonLicense:Apache-2.0Stargazers:15Issues:7Issues:3

sbb_ned

Named Entity Disambiguation and Linking

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:13Issues:4Issues:3

sbb_ocr_postcorrection

Two-Step Approach to OCR Post-Correction

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:13Issues:4Issues:5

sbb_images

Annotation Tool and Image Search

mods4pandas

Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis

Language:PythonLicense:Apache-2.0Stargazers:10Issues:4Issues:30

sbb_pixelwise_segmentation

Pixelwise segmentation for document images

Language:PythonLicense:Apache-2.0Stargazers:10Issues:5Issues:12

ocrd-galley

A Dockerized test environment for OCR-D processors 🚢

Language:ShellLicense:Apache-2.0Stargazers:7Issues:5Issues:75

page2tsv

PAGE-XML to TSV

Language:PythonLicense:Apache-2.0Stargazers:4Issues:5Issues:7

sbb_column_classifier

Get the number of columns for a document image

Language:PythonLicense:Apache-2.0Stargazers:3Issues:4Issues:8

ocrd_repair_inconsistencies

Automatically re-order lines, words and glyphs to become textually consistent with their parents.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:3Issues:6

ocrd_trocr

OCR-D processor for TrOCR

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

publications

Qurator-SPK team publications

ocrd_calamari

Recognize text using Calamari OCR and the OCR-D framework

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

PyTorch-YOLOv3

Minimal PyTorch implementation of YOLOv3

Language:PythonLicense:GPL-3.0Stargazers:1Issues:0Issues:0

sbb_knowledge-base

Wikidata + Wikipedia Knowledge-Base Extraction for EL-purposes

Language:PythonStargazers:1Issues:2Issues:0

abbyy-to-alto

Converts FineReader abbyy.xml to alto.xml.

License:MITStargazers:0Issues:0Issues:0

core

Collection of OCR-related python tools and wrappers from @OCR-D

License:Apache-2.0Stargazers:0Issues:0Issues:0

download-gitter.im-chat

tiny tool to download gitter.im chat

Language:PerlLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0

sbb_tools

Digitalized Collections of the Berlin State Library: ALTO-XML Processing Tools / batch NER + EL / BERT-pre-training

Language:PythonStargazers:0Issues:3Issues:0

sbb_topic-modelling

Topic Modelling

Language:PythonStargazers:0Issues:2Issues:0

sbb_utils

shared functionality

Language:PythonStargazers:0Issues:2Issues:0

sbb_web-integration

Visualization of NER+EL+Topic Modelling + Image-Search

Language:JavaScriptStargazers:0Issues:0Issues:0

setuptools_ocrd

Manage your package version through ocrd-tool.json

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0