axa-group / nlp.js

An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

What is the schema for corpus.json file and possible entities?

rmtuckerphx opened this issue · comments

Issue Template

Summary

Looking at this sample corpus.json file (https://github.com/axa-group/nlp.js/blob/master/examples/14-ner-corpus/corpus.json) I see that the entities sectioncan be an enum of values (hero) or a regex (email).

What other entity types are supported?
Is there a published schema for corpus.json?

Your Environment

Software Version
nlp.js 4.26.1
node 16.16.0
npm 8.13.2
Operating System Windows 11

You can find a structure of JSON schema's for this here & all the entity types that can be defined here.

Hope this helps @rmtuckerphx

ps; not an official contributor / maintainer! But please close this issue when you get a chance! (Assuming I answered your question). Cheers 🥂