machine-learning coreference-resolution huggingface allen-nlp

Measuring Gender Bias in Coreference Resolution Systems

Team

Student name	CCID
Ravika Nagpal	ravika
Natalie Hervieux	nhervieu

Execution Instructions

Run

Use the google colab notebooks provided in src folder to run the two models:

E2E model - provided by Allen NLP and based on the work of Lee, Kenton & He, Luheng & Lewis, Mike & Zettlemoyer, Luke. 2017. End-to-end Neural Coreference Resolution.
Neural Coref model - provided by Huggingface and based on the work of Kevin Clark and Christopher D. Manning. 2016. Deep reinforcement learning for mention-ranking coreference models. In Empirical Methods on Natural Language Processing.

Data

The data used can be found in data folder. The csv files are taken from the github repo of the paper Levy, Shahar & Lazar, Koren & Stanovsky, abriel. (2021). Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation.

Apart from the csv files, the data folder contains folder gold_stats which contain files showing our analysis on the BUG dataset which is also reported in the report.

Output

The output of the run done on gold_BUG dataset is stored in csv files corresponding to each model and can be found in the output directory.

About

Evaluate NeuralCoref and E2E models on BUG dataset

machine-learning coreference-resolution huggingface allen-nlp

Languages

Language:Jupyter Notebook 100.0%