Katabase's repositories
3_WikidataEnrichment
align manuscript authors with wikidata entities, create a database on those through sparql, add the wikidata ids to the catalogues
Catalogues
Specifications and example for encoding catalogues with GROBID
Application
Web app and API of the Katabase/MSS project.
Language:PythonGPL-3.0000
New_OutputData
Encoded TEI-XML catalogues
GPL-3.0000
Language:Python000
OCRcat
Data por OCR
Language:Shell000
Language:Python000
visualisations
visualisations produites à partir du json créé en fin d'étape 4 (4_TaggedData)
Language:PythonGPL-3.0000
1_OutputData
Digitsed catalogues
Language:PythonGPL-3.0000
2_CleanedData
Cleaned catalogues.
Language:PythonGPL-3.0000
4_TaggedData
Tagged catalogues.
Language:PythonGPL-3.0000
Data_extraction
This repository contains everything we need for the data extraction.
Language:HTML000
GROBID_typo
Training data with the typographical information for GROBID-Dictionaries
Language:CSS000