lang-uk / ner-uk

Ukranian NER annotation project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Prepare scripts to convert existing corpus to the format suitable to train other models

dchaplinsky opened this issue · comments

Once #11 is done, we'll have a shortlist of models to train and use.
Please prepare a set of scripts that will convert existing corpus in the BRAT format to the formats suitable for those models.

Ideally it should be a CLI python script that accepts output path and type of format.

Done for Mitie and Stanza