SLAPaper / Word-Cooccurence

Tools to find word co-occurence in given sentences.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Word Co-occurence

Tools to find word co-occurence in given sentences.

Format definition

Text input

The input text file is defining like this:

a tab seperated file(TSV):

ID \t sentence

NOTICE: all spaces is ignore

Keyword(entity) input

The keyword input can should be a json file with objects(dictionaries).

{
    "keyword1": {
        "abbreviation": [
            "key1_abbr1",
            "key1_abbr2"
        ],
        "full_name": [
            "key1_form1",
            "key1_form2"
        ]
    },
    "keyword2": {
        "full_name": [
            "key2_form1",
        ],
        "regular_expression":[
            "key2_re1"
        ]
    }
}

ID-Entity Output

a tab seperated file(TSV):

NOTICE: all spaces is ignore

ID \t keyword \t key_form

ID-Sentence-Entities Output

a tab seperated file(TSV):

NOTICE: all spaces is ignore

ID \t sentence \t keyword1, keyword2, keyword3

About

Tools to find word co-occurence in given sentences.

License:MIT License


Languages

Language:Python 100.0%