zxlzr / IEDatasetZoo

Information Extraction Dataset Zoo.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

IE Dataset Zoo

Information extraction dataset zoo.

Contributed by Ningyu Zhang, Shumin Deng.

Named Entity Recognition

Dataset #Type. #Inst. Feature Source Resource Origin
Few-NERD 66 188,200 Few-shot Wikipedia+Wikidata url url

Relation Extraction

Sentence-Level

Dataset #Rel. #Inst. Feature Source Resource Origin
Fewrel 100 44,800 Supervised Wikipedia+Wikidata url url
TACRED 42 68,120 Supervised Newswire+web - url
Semeval 19 8,000 Supervised Web url url
Wikidata 352 495,883 Distent-supervision Wikipedia+Wikidata url url
NYT10(tsinghua) 53 522,043 Distent-supervision NYT+Freebase url url
NYT10-large(tsinghua) 53 570,088 Distent-supervision NYT+Freebase url url
NYT-Wikidata 100 882,177 Distent-supervision NYT+Wikidata url url
NYT10-29 29 70,339 Distent-supervision NYT+Freebase url url
NYT11-12 12 62,648 DS+supervised NYT+Freebase url url
NYT-manual 24 235,982 Distent-supervision NYT+Freebase url url
NYT-Wiki(zju) 73 1,989,377 Distent-supervision NYT-Wikipedia-Wikidata url url
Wiki-KBP 19 23,784 Distent-supervision Wikipedia+KBP+Freebase url url
PubMed-BioInfer 94 1,580 Distent-supervision PubMed+NESH - url
WebNLG 14 75,325 Supervised Web - url
SKE 50 173,108 Supervised Web url url
KBP37 37 15,916 Supervised Web url url
T-REx 642 6.3M Distent-supervision Wikipedia+Wikidata - url
Google-RE 5 59,576 Supervised Wikipedia - url
ADE 3 23,516 Supervised Medical Report url url

Other Datasets

Document-Level

Event Extraction

Dataset # Inst. Feature Source Resource Origin
ACE05 599 Supervised Web - url
FewEvent(zju) 71,385 Supervised ACE05+_TAC-KBP17 url url
CCKS2019_Event 17,815 Supervised Financial Announcements url url
Doc2EDAG 32,040 Supervised Financial Announcements url url

How to Cite

About

Information Extraction Dataset Zoo.