novakat's repositories
NYTK-NerKor-Cars-OntoNotesPP
A 1M+-token Hungarian named entity dataset with ~30 entity types derived from NYTK-NerKor
boilerplateResults
Results of boilerplate removal algorithms
Language:Python000
000
A 1M+-token Hungarian named entity dataset with ~30 entity types derived from NYTK-NerKor
Results of boilerplate removal algorithms