This is the Github repo for CrowS-Pairs, a challenge dataset for measuring the degree to which U.S. stereotypical biases present in the masked language models (MLMs). The associated paper is to be published as part of The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020).
The dataset along with its annotations is in crows_pairs_anonymized.csv. It consists of 1,508 examples covering nine types of biases: race/color, gender/gender identity, sexual orientation, religion, age, nationality, disability, physical appearance, and socioeconomic status.
Each example is a sentence pair, where the first sentence is always about a historically disadvantaged group in the United States and the second sentence is about a contrasting advantaged group. The first sentence can demonstrate or violate a stereotype. The other sentence is a minimal edit of the first sentence: The only words that change between them are those that identify the group. Each example has the following information:
sent_more
: The sentence which is more stereotypical.sent_less
: The sentence which is less stereotypical.stereo_antistereo
: The stereotypical direction of the pair. Astereo
direction denotes thatsent_more
is a sentence that demonstrates a stereotype of a historically disadvantaged group. Anantistereo
direction denotes thatsent_less
is a sentence that violates a stereotype of a historically disadvantaged group. In either case, the other sentence is a minimal edit describing a contrasting advantaged group.bias_type
: The type of biases present in the example.annotations
: The annotations of bias types from crowdworkers.anon_writer
: The anonymized id of the writer.anon_annotators
: The anonymized ids of the annotators.
Use Python 3 (we use Python 3.7) and install the required packages.
pip install -r requirements.txt
The code for measuring stereotypical biases in MLMs is available at metric.py. You can run the code using the following command:
python metric.py
--input_file data/crows_pairs_anonymized.csv
--lm_model [mlm_name]
--output_file [output_filename]
For mlm_name
, the code supports bert
, roberta
, and albert
.
The --output_file
will store the sentence scores for each example. It will create a new CSV (or overwrite one with the same name) with columns sent_more, sent_less, stereo_antistereo, bias_type
taken from the input, and additional columns:
sent_more_score
: sentence score forsent_more
sent_less_score
: sentence score forsent_less
score
: binary score, indicating whether the model is biased towards the more stereotypical sentence (1
) or not (0
).
Please refer to the paper on how we calculate the sentence score.
Note that, if you use a newer version of transformers (3.x.x), you might obtain different scores than the one reported in our paper.
CrowS-Pairs is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. It is created using prompts taken from the ROCStories corpora and the fiction part of MNLI. Please refer to their papers for more details.
If you use CrowS-Pairs or our metric in your work, please cite it directly:
@inproceedings{nangia2020crows,
title = "{CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models}",
author = "Nangia, Nikita and
Vania, Clara and
Bhalerao, Rasika and
Bowman, Samuel R.",
booktitle = "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing",
month = nov,
year = "2020",
address = "Online",
publisher = "Association for Computational Linguistics"
}