fostroll / rucor_to_conllu

RuCor corpus to CoNLL-U format conversion

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

rucor_to_conllu: RuCor corpus to CoNLL-U format conversion

This notebook provides a pipeline to convert RuCor corpus to CoNLL-U format.

Prerequisites

  1. Toxine preprocessor.
  2. Opional: Morphological parser MorDL with UPOS and FEATS models already trained.

Usage

All pipeline is contained rucor_to_conllu.ipynb notebook file. Just set the paths to the RuCor and MorDL models and run. Maybe, you'll want also to change some variables in cells 2 or/and 4.

License

rucor_to_conllu is released under the Creative Commons License. See the LICENSE file for more details.

About

RuCor corpus to CoNLL-U format conversion

License:Creative Commons Zero v1.0 Universal


Languages

Language:Jupyter Notebook 100.0%