RISE-UNIBAS / transkribus-extract-textual-tags

Extract textual tags from Transkribus PAGE XML to CSV

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

transkribus-export-textual-tags

Python module to extract textual tags from Transkribus PAGE XML to CSV.

Creator

This software and sample dataset were created by the University of Basel's Research and Infrastructure Support RISE (rise@unibas.ch) in November 2022.

Quickstart

Export Transkribus PAGE XML documents to a folder and extract the textual tags by running

from transkribus_extract_textual_tags.client import Client

Client.extract_from_dir(dir_path="my/transkribus/document/folder",
                        save_file_path="my/output/file.csv"")

License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

About

Extract textual tags from Transkribus PAGE XML to CSV

License:Creative Commons Attribution 4.0 International


Languages

Language:Python 100.0%