This is the python procedures for preprocessing New York Times dataset which userd for distant supervision relation extraction. We also have a brief statistic on this dataset, we think it suffer from noisy and long-tail problems.
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool