Petr Plechac's repositories
rhymetagger
A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Spanish poetry
corpusCzechVerse
This repo contains 1305 books of poetry from the Corpus of Czech Verse. Annotated poetic meters, rhymes, tokenized, lemmatized, POS-tagged.
phoebeConverter
Converter from PhoEBE (phonetic notation used in Corpus of Czech Verse) to IPA, X-SAMPA, and Czech Phonetic Transcription.
stichometry
Stylometric analysis of poetic texts based on their versification
metrique-en-ligne
Métrique en Ligne Corpus (French)
poetry-corpus
Corpus of Hungarian poems in TEI XML with machine annotation
versification_authorship
Data and replication code for P. Plecháč (2021). Versification and Authorship Attribution
NHB-2018-OEstylometry
Replication code for Neidorf et al., "Large-scale quantitative profiling of the Old English verse tradition," Nature Human Behaviour 2019
poetree_deduplication
Scripts performing deduplication of PoeTree corpora
poetry-emotion
Poetry Corpora Annotated on Aesthetic Emotions
stylometry_tutorials
A set of interactive webpages illustrating some elements of stylometry
tvt2020
Data a kód k prezentaci z Týdne vědy a techniky 2020