What should I do with the data?

Question

What should I do with the data?

fairy-of-9 opened this issue 4 years ago · comments

I downloaded OntoNotes Release 5.0.

and I did e2e-coref's getting started.

I created directories (data/train,data/development,data/test)
and data(output of getting started) are located in directories like data/train/train.english.v4_gold_conll

Did I miss anything or do something wrong?

Thanks.

henry · Answer 1 · Wed Jun 24 2020 09:12:09 GMT+0800 (China Standard Time)

This article maybe helpful：https://zhuanlan.zhihu.com/p/121786025

Shadman Rohan · Answer 2 · Wed Dec 09 2020 13:56:28 GMT+0800 (China Standard Time)

@fairy-of-9 did you manage to train?

fairy-of-9 · Answer 3 · Thu Dec 10 2020 10:35:18 GMT+0800 (China Standard Time)

@ShadmanRohan sry. I couldn't train.

Marur Srikanta · Answer 4 · Wed Aug 11 2021 17:54:50 GMT+0800 (China Standard Time)

Hi Shayne,
Congratulations on the great work with Coreference Resolution model.

Unfortunately, I do not have Ontonotes dataset and am using my .txt file. I am unable to find any useful link to convert .txt file into conll 2012 format. I tried using conll u format for training but did not succeed. It would be great if you can answer the following questions:

1.) Which tool can be used to annotate the text to match the coreferences
2.) Can your packages handle custom training

Please revert at a convenient time of yours.

Thanks and Regards
Marur Srikanta