sdam-au / digital_epigraphy

Inscriptions as data: digital epigraphy in macro-historical perspective

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Inscriptions as data: digital epigraphy in macro-historical perspective

Epigraphic production in Roman provinces in time (EDH)

Authors

  • Petra Hermankova , Social Dynamics in the Ancient Mediterranean project, Aarhus University, petra.hermankova@cas.au.dk
  • Vojtech Kase , Social Dynamics in the Ancient Mediterranean project, Aarhus University
  • Adela Sobotkova , Social Dynamics in the Ancient Mediterranean project, Aarhus University

Abstract

As short texts written on a durable medium, inscriptions represent invaluable insight into past societies, their organization, cultural norms and practices. Several hundred thousand inscriptions in Greek and Latin language survived until today, providing us with a line of evidence concerning populations of large cities and rural communities of the entire Mediterranean Basin in the period between the eighth century BC and eighth century AD. Although published inscriptions have been near-completely digitized and exist in online databases, and open computational tools exist to handle large datasets, large-scale and comparative studies of inscriptions are still rare. Numerous technical and conceptual issues, such as the inconsistent resolution of spatial and temporal attributes or the incompatibility of data structures between datasets, hinder the aggregation and analysis of thousands of inscriptions. The incomplete, uncertain and complex nature of inscriptions as a historical source required us to develop a series of custom open-source tools and reproducible pipelines, enabling a macro-scale overview of epigraphic production in time and space. To illustrate the potential of quantitative studies in epigraphy, we harvest and render comparable two well-established, yet very distinct, digital collections for Latin epigraphy: Epigraphic Database Heidelberg (EDH), containing over 81,000 records and Epigraphische Datenbank Clauss-Slaby (EDCS) with over 500,000 records. Placing the datasets side-by-side, we contrast past interpretations of epigraphic habit based on limited samples with trends derived from all available data and discuss their strengths and shortcomings of each respective dataset. We assert that research communities stand to gain from extending digital infrastructures to reduce barriers to access with packages of open and reusable research tools.

Keywords

digital epigraphy; quantitative research; FAIR science; reproducibility; digital history; Latin inscriptions; epigraphic production

License

CC-BY-SA 4.0, see attached License.md

How to cite this resource (DOI, etc.)

TBA


Data

  1. Epigraphic Database Heidelberg (EDH) dataset is accessed and transformed by the series of Python and R scripts in EDH ETL repository and in EDH exploration repository, created by SDAM Project. The latest version of the dataset (as JSON file) can be accessed via Sciencedata.dk or at this link: EDH_public folder
  1. Epigraphic Database Clauss-Slaby (EDCS) dataset is accessed and transformed by the series of Python and R scripts in EDCS ETL repository, created by SDAM project. The latest version of the dataset (as JSON file) can be accessed via Sciencedata.dk or at this link: EDCS_public folder.

Data Access

Access with R (using custom sdam package)

resp = request("EDH_text_cleaned_2021-01-21.json", path="/sharingin/648597@au.dk/SDAM_root/SDAM_data/EDH/public", method="GET")

Access with Python (using custom SDDK package)

!pip install sddk
import sddk
auth = sddk.configure("SDAM_root", "648597@au.dk") # where "648597@au.dk is owner of the shared folder
EDH_utf8 = sddk.read_file("public/b6b6afdb969d378b70929e86e58ad975/EDH_text_cleaned_2021-01-21.json", "df", auth)

About

Inscriptions as data: digital epigraphy in macro-historical perspective

License:Creative Commons Attribution Share Alike 4.0 International


Languages

Language:Jupyter Notebook 100.0%