malllabiisc / RESIDE

EMNLP 2018: RESIDE: Improving Distantly-Supervised Neural Relation Extraction using Side Information

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Could you plz release the id2entitytype dict corresponding to the provided riedel_preprocessed.pkl?

loginaway opened this issue · comments

Hi,

We are trying to do some comparison based on the RESIDE model, and it turns out that we need to align the entity pairs in your dataset with that in our version(due to a random permutation, the index are different but the total items are in total the same).

We wonder whether you could release the mapping from line index (e.g. 0, 1, 2, ...) in the test file in 'riedel_preprocessed.pkl' to the entity pairs id (e.g. ("m.0chrx", "m.01_d4"), ("m.0abcds", "m.01_-jfew"), ("m.0chrx", "m.01ojlll")).

Or alternatively, could you release the 'riedel_test_bags.json' and 'riedel_train_bags.json' files, so that we can generate the map on our own? We have tried running the preprocessing codes from scratch, but maybe due to randomness, we could not reproduce the result exactly.

Thank you!

Hi @loginaway,
I hope you are doing great! Thanks for your help in improving WordGCN code :)

Regarding this issue, I am sharing bags riedel_train_bags.json and link to riedel_test_bags.json is already there in the readme. Let me know if this resolves your problem.
https://drive.google.com/file/d/1Hoak1oxEHf4hsjTZ8oyPaslBgS0J-90s/view?usp=sharing

@svjan5
Thanks! (You still remember that!)

Do you release the correct version of riedel_train_bags.json on Riedel NYT dataset? I found that it is even smaller in size than the riedel_test_bags.json, and it has only 156,619 lines...

Hi @loginaway,
:) yeah, I remember your username.

I am sharing another file; hopefully, this is what you want. A long time has passed, so I have lost a bit of touch with this work. Let me know if it worked.
https://drive.google.com/open?id=1HdyekUcOBUfDQMW7hgXdJ5S1Rnap5yJS

Thanks, it works😊