There are 2 repositories under hate-speech topic.
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
Korean HateSpeech Dataset
Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.
DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.
Code for the paper "Characterizing and Detecting Hateful Users on Twitter"
A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.
This repository contains papers and resources pertaining to Hate speech research.
Resources and tools for the Tutorial - "Hate speech detection, mitigation and beyond" presented at ICWSM 2021
Data and code from our stories, "Google Has a Secret Blocklist that Hides YouTube Hate Videos from Advertisers—But It’s Full of Holes," and "Google Blocks Advertisers from Targeting Black Lives Matter YouTube Videos."
Capstone project to automate Twitter hate speech detection with classification modeling.
Can fear be used for polarisation and spreading negativity? Our paper accepted in The Web conference 2021 tries to explore this question in light of public Whatsapp groups.
iVerify Apps: Apps that support the AI-powered iVerify platform to combat misinformation and hate speech
Trained Neural Networks (LSTM, HybridCNN/LSTM, PyramidCNN, Transformers, etc.) & comparison for the task of Hate Speech Detection on the OLID Dataset (Tweets).
A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.
Intersectional bias in hate speech and abusive language datasets
Code for replicating results of team 'hateminers' at EVALITA-2018 for AMI task
Understanding hateful subreddits through text clustering
Prototype to detect Spanish hate-speech against women online.
Dataset and code implementation for the paper "Decoding the Underlying Meaning of Multimodal Hateful Memes" (IJCAI'23).
Useful resources for hate speech detection in Spanish
Hierarchical Multi Label Hate Speech and Abusive Language Classification
Detecting hate speech using the spoken content of videos using Machine Learning
Resources on hate speech moderation through counter narratives, counter speech
Classifying hate speech with deep learning (honors thesis 2017-18)
Репозиторий python-пакета "toxicity". Выявление токсичного контента в русскоязычных текстах c помощью глубокого обучения.
PHARM (Preventing Hate Against Refugees and Migrants) is a European project funded by the European Union, within Rights, Equality and Citizenship program. The main goal of the project is to monitor and model hate speech against refugees and migrants in Greece, Italy and Spain in order to predict and combat hate crime.
Repository hosting dataset for the Shared Task on Aggression Identification during First Workshop on Trolling, Aggression and Cyberbullying (TRAC - 1) as COLING - 2018. Please visit the workshop website - https://sites.google.com/view/trac1/home - for more details
Comparação de algoritmos de aprendizado profundo na classificação de comentários contendo discurso de ódio na internet.
Models to detect hateful comments served with flask trained on Kaggle's Toxic Comment Classification Challenge dataset.
Knowledge-bound counter speech generation to challenge hate speech