There are 6 repositories under hatespeech topic.
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
[DEPRECATED] A browser extension to block likers, retweeters, list members and Twitter ads and share your block lists with others. - say NO to hate speech!
Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.
Deep Learning models to detect hate speech in tweets
A minimum-dependency ECMAScript client library and CLI tool for Parler – a "free speech" social network that accepts real money to buy "influence" points to boost organic non-advertising content
Python code to detect hate speech and classify twitter texts using NLP techniques and Machine Learning
This repository contains papers and resources pertaining to Hate speech research.
This repository contains Korean Hate Speech dataset for paper, "K-MHaS: A Multi-label Hate Speech Detection Dataset in Korean Online News Comment", accepted by COLING2022.
Resources and tools for the Tutorial - "Hate speech detection, mitigation and beyond" presented at ICWSM 2021
This is a python project that is used to identify hate speech in tweets. The dataset used to train the model is available on Kaggle and consists of labelled tweets where 1 indicates hate speech tweets and 0 indicates non-hate speech tweets.
Repository for the paper "Thou shalt not hate: Countering Online Hate Speech" accepted at ICWSM 2019.
NLP model that uses Machine Learning to detect offensive tweets, and classify it's target.
Can fear be used for polarisation and spreading negativity? Our paper accepted in The Web conference 2021 tries to explore this question in light of public Whatsapp groups.
Turkish and English Dataset from "Large-Scale Hate Speech Detection with Cross-Domain Transfer"
Testing and training detection models for emoji-based hate speech.
Contains code for a voting classifier that is part of an ensemble learning model for tweet classification (which includes an LSTM, a bayesian model and a proximity model) and a system for weighted voting
A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.
Trained Neural Networks (LSTM, HybridCNN/LSTM, PyramidCNN, Transformers, etc.) & comparison for the task of Hate Speech Detection on the OLID Dataset (Tweets).
KAREN: Unifying Hatespeech Detection and Benchmarking
This repository contains the resources used for presentation/discussion in weekly iRE Lab meetings.
Code for replicating results of team 'hateminers' at EVALITA-2018 for AMI task
SemEval 2022 Task 5: Multimedia Automatic Misogyny Identification - baseline models and dataset
A nlp framework to find hate speech comments out of a comments corpus.
CounterGeDi is a pipeline that aims at controlling the counter speech generated to make it emotional, polite and detoxified. Paper accepted at IJCAI 2022.
🔒 A bot to log and prevent hate speech across many servers on Discord
Author Profiling for Abuse Detection (COLING 2018)
A contextual approach for detecting hate speech code words
Hierarchical Multi Label Hate Speech and Abusive Language Classification
Multilingual Offensive Lexicon consists of the first contextual lexicon for abusive language detection, which is composed of 1,000 explicit and implicit terms and expressions with any pejorative connotation annotated with contextual information
A hate speech data set constructed using IR pooling technique to enhance diversity