AlexPoint / OpenNlp

Open source NLP tools (sentence splitter, tokenizer, chunker, coref, NER, parse trees, etc.) in C#

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to train own model for NER?

UNICASTERSOL opened this issue · comments

Anyone knows how to train own NER model?

Sorry, the question should be:-
Anyone knows how to train own NER model? I have tried to train my own but I have reached to the stage whereby it run out memory. Anyone know what should I configure for optimum for CUTOFF and Iteration please?

Did you got any solution @UNICASTERSOL ? I'm trying to get this working as well

I would love to know more. Did you receive any information?

commented

I haven't trained any NER model so far.
A good start would be to have a look at the original java project documentation.
From what I see in the code, you have to use the MaximumEntropyNameFinder class, and more specifically the TrainModel method. The input is a training file which should have a specific tagging with examples of the entities to detect (see NameFinderEventReader class to see how this training file is used).
Let me know if you manage to train your own model.