ololobus / slavic_text_scht

St. Petersburg corpus of hagiographic texts

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

St. Petersburg Corpus of Hagiographic Texts

Old Church Slavic corpus

http://project.phil.spbu.ru/scat/page.php?page=project

Parser

Run to get entire xml text.

./tei_parser.py xml/Aleksandr_svirskij.xml

TODO:

  • return text sentence by sentence
  • return text clause by clause
  • keep info about named entities (<name> tag)

About

St. Petersburg corpus of hagiographic texts


Languages

Language:Python 100.0%