Coptic SCRIPTORIUM's repositories
coptic-nlp
Coptic NLP pipeline page and utilities
CopticScriptorium.github.io
Website repository for copticscriptorium.org
converters
Converters to encode text into different Coptic text formats (UTF-8 character encoder, SGML processor, etc.)
tokenizers
Coptic SCRIPTORIUM Tokenization Script
tagger-part-of-speech
Part of speech tagger for Sahidic Coptic
dictionary
The dictionary comprised of the Coptic lexicon created by the BBAW and interface by Coptic SCRIPTORIUM. Currently deployed at https://corpling.uis.georgetown.edu/coptic-dictionary/
entity-tagging
Coptic SCRIPTORIUM materials for entity tagging
normalizer
Normalizes orthography
shenoute-unplaced-dev
This repository is for documents attributed to Shenoute of Atripe but the specific work is unknown or the work cannot be placed in a specific volume of Discourses or Canons
editions-public-domain
Published text editions that are in the public domain but not ready for public release by Coptic SCRIPTORIUM
misc-development
Miscellaneous issues and items under development. Park your stuff here if you don't know what else to do with it.
pelagios-dataset-summary
Dataset summary files for linking via Pelagios
embeddings
Continuous word representations for Coptic
ethercalc-tools
Tools for interfacing with EtherCalc
IACS2022
Materials for the tutorial session at the International Association of Coptic Studies 2022 Congress
lexical-taggers
lexical taggers (language of origin, lemmatizer) for Sahidic Coptic
paths-longtexts-dev
hagiographical and other longer Sahidic Coptic texts digitized by PATHS