nefujiangping / IE

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

About raw data

  • Some words in paper are cut because of typesetting, and these words are represented as prehalf-backhalf.
    • These words are taken into consideration when pre-preprocessing???
    • If not, this will be a problem when look up in Embedding table.

Some doubt

  • prediction of relations and corefs' entities aren't in the prediction of entities. But ground truth is different: each entity of relation and coref is definately in the ground truth entity list.

About annotation

  • annotation guideline
    • pdf1: more annotation examples.
    • pdf2: fix some typo.
  • Greedy:Spans can be embedded in longer spans, only if the shorter span is involved in a relation.
  • If generic phrases are not involved in a relation, do not tag them
    • ??? involved in a relation ??? in pdf1 Figure 1 sentence #3 approach-Generic
    • This can be done by post-delete operation.
  • ??? some terms are difficult to recognize. ???
  • "Multi-...KGC" mentioned that the dataset is annotated by experts.
  • Expression following the phrase "such as" has three components or more.
    • all of them are involved in the relation of conjunction

Beyond these 6 Relation & Entity Type

  • Entity Literature
  • Relation Evaluate-for in pdf1 Figure 3 sentence #6
  • Relation Proposed-by, Synonym-of

Tools

Final Entity & Relation Type

Entity Relation
1. Task 1. Used-for
2. Method 2. Feature-of
3. Evaluation Metric 3. Hyponym-of
4. Material 4. Part-of
5. Other Scientific Terms 5. Compare
6. Generic 6. Conjunction
7. Literature 7. Evaluate-for
8. Proposed-by
9. Synonym-of

Entity Literature,Relation Proposed-by

Literature,Proposed-by

Relation Evaluate-for

Evaluate-for

Relation Synonym-of

Synonym-of Synonym-of

About