dadelani / nepali-ner

Nepali NER dataset and code

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

nepali-ner

Nepali NER dataset and code:

Annotation Guide:

Guide

More details

Short tutorial on building NER datasets and models for new languages

Tasks:

Task 1: Annotate NER dataset (~510 sentences) --- NAAMII NER

Task 2: Split annotated sentences into 100/100/410 train/dev/test split

Task 3: Train NER baseline model on the new dataset (NAAMII NER)

Task 4: Train NER models on CoNLL03, WikiANN (NE), WikiANN (HI), and Singh NER and evaluate on the new dataset (NAAMI NER)

Author

David Adelani for Third Nepal Winter School in AI

About

Nepali NER dataset and code

License:Apache License 2.0


Languages

Language:Jupyter Notebook 98.5%Language:Python 1.4%Language:Shell 0.1%