dbpedia / fact-extractor

Fact Extraction from Wikipedia Text

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Prepare job data for the event

marfox opened this issue · comments

It should contain 500 sentences for each of the following frames.
Triggering LUs are aside: ranked ones as per this file in bold, while the others come from Kicktionary or FrameNet.

Frame LUs
Attività (Activity) andare, esordire, debuttare, giocare, rimanere
Partita (Match) affrontare, giocare, incontrare
Vittoria (Victory) battere, sconfiggere, vincere
Sconfitta (Defeat) crollare, perdere, piegarsi
Stato (State) rimanere
Trofeo (Finish_Competition) vincere

Starting with the ranked ones

andare seems to co-occur with giocare very often, we may skip it.

crollare, only 4 sentences in the corpus, we may skip it.

In general, unranked verbs have few occurrences in the corpus.
This suggests to pursue the data-driven way with ranked ones only.