WordBias: An Interactive Visual Tool for Discovering Intersectional Biases Encoded in Word Embeddings

Read paper (including Supplementary material) PDF
Video Presentation (5min) https://www.youtube.com/watch?v=LcwlyU3QT0w
Live DEMO http://130.245.128.219:6999/

Paper accepted at ACM SIGCHI 2021 Late Breaking Work

The above picture shows the visual interface of WordBias. The image can be broken into 3 parts:
(A) The Control Panel provides options to select words to be projected on the parallel coordinates plot
(B) The Main View shows the bias scores of selected words (blue lines) along different bias types (axes)
(C) The Search Panel enables users to search for a word and display the search/brushing results.

In the above figure, the user has brushed over 'Male' and 'Islam' subgroups. Words with strong association to both these subgroups are listed below the search box like bomb, terror, aggression, etc. This suggests that Word2vec embedding contains biases against Muslim males.

For a quick starter on Parallel Coordinates, please refer to this link.

Video Teaser

wordbias_preview.mp4

Overview

WordBias is an interactive visual tool designed to explore biases against intersectional groups like black females, black muslim males, etc. encoded in word embeddings. Our tool considers a word to be associated with an intersectional group say ‘Christian Males’ if it associates strongly with each of its constituting subgroups (Christians and Males). Our tool aims to act as an effective auditing tool for experts, an educational tool for non-experts and enhance accessibility for domain experts.

Installation Instructions

Clone this repo
Install Dependencies like flask, gensim, py_thesaurus, etc.
Run python app.py
Browse localhost:6999

Citation

@inproceedings{ghai2021wordbias,
  title={WordBias: An Interactive Visual Tool for Discovering Intersectional Biases Encoded in Word Embeddings},
  author={Ghai, Bhavya and Hoque, Md Naimul and Mueller, Klaus},
  booktitle={Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems},
  pages={1--7},
  year={2021}
}

Feel free to email me for any questions, comments at bghai@cs.stonybrook.edu

About

WordBias: Visualizing Intersectional Social biases encoded in Word Embeddings

fairness fairness-ml visualization

Languages

Language:Jupyter Notebook 50.8%Language:JavaScript 42.5%Language:Python 2.7%Language:HTML 2.7%Language:CSS 1.2%Language:Shell 0.1%