spencermountain / compromise

modest natural-language processing

Home Page:http://compromise.cool

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Issues with .people() - "Bill Gates" - missed.

MarketingPip opened this issue · comments

Just doing some playing -

Melinda and Bill Gates

Melinda is parsed. Bill Gates is not!

Tho this might be a tough rule to parse being "gates" is a noun. Maybe just set a hard coded rule-set like this -

(#FirstName:uppercase|#MaleName:uppercase|#FemaleName:uppercase) #Noun:uppercase)

could catch some names.

As well it seems to be missing a lot of tags (tho we need to work on the first / last name DB being improved).

Tags returned for "Bill" currently are:

[
        "Noun",
        "Singular"
]
["switch": "Person|Noun"]

Tags returned should be:

[
        "Noun",
        "Singular",
        "Person",
        "FirstName",
        "ProperNoun",
        "MaleName"
      ]
["switch": "Person|Noun"]

@spencermountain - closing this as I seen it has been added with that entity list a few weeks ago! I will be improving the name list tho to include some names that are common with various races.