hate-speech

There are 2 repositories under hate-speech topic.

unitaryai / detoxify
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.
bert bert-model huggingface-transformers huggingface nlp toxic-comment-classification toxicity toxic-comments sentence-classification kaggle-competition pytorch-lightning hatespeech hate-speech-detection toxicity-classification hate-speech
Language:Python 848
t-davidson / hate-speech-and-offensive-language
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
hatespeech offensive nlp icwsm twitter abuse offensive-language hate-speech natural-language-processing dataset labeled-data classifier machine-learning computational-social-science
Language:Jupyter Notebook 755
kocohub / korean-hate-speech
Korean HateSpeech Dataset
dataset korean-nlp hate-speech natural-language-processing
366
Hironsan / HateSonar
Hate Speech Detection Library for Python.
hate-speech machine-learning natural-language-processing python
Language:Jupyter Notebook 179
hate-alert / HateXplain
Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.
explainability hate-speech bias offensive detection hatespeech interpretable-deep-learning bert-model lstm attention-lstm bert-fine-tuning
Language:Python 177
surge-ai / toxicity
The world's largest social media toxicity dataset.
toxicity content-moderation hate-speech hate-speech-detection
171
hate-alert / DE-LIMIT
DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.
bert laser-embeddings classification cnn-gru hate-speech multilingual
Language:Jupyter Notebook 104
manoelhortaribeiro / HatefulUsersTwitter
Code for the paper "Characterizing and Detecting Hateful Users on Twitter"
abuse-detection hate-speech twitter
Language:Jupyter Notebook 71
napolab
ruanchaves / napolab
A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.
benchmarks catalan datasets english galician hate-speech huggingface large-language-models nlp portuguese question-answering semantic-similarity spanish text-simplification textual-entailment transformers huggingface-transformers python
Language:Python 51
hate-alert / Hate-Speech-Reading-List
This repository contains papers and resources pertaining to Hate speech research.
reading-list hatespeech hate-speech counterspeech counter-speech research hate speech counter
41
hate-alert / Tutorial-Resources
Resources and tools for the Tutorial - "Hate speech detection, mitigation and beyond" presented at ICWSM 2021
hatespeech hate-speech-detection hate-speech counterspeech twitter tutorial icwsm2021 nlp natural-language-processing bert-model xlmroberta xlm-roberta huggingface-transformers huggingface abuse-detection
Language:Python 33
phusroyal / ViHOS
Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans Detection" (EACL2023)
dataset deep-learning hate-speech natural-language-processing social-media-mining nlp machine-learning python3 sequence-labeling span-detection span-prediction vietnamese-dataset vietnamese-nlp vihos benchmark benchmark-datasets
Language:Jupyter Notebook 31
investigation-youtube-ad-placements
the-markup / investigation-youtube-ad-placements
Data and code from our stories, "Google Has a Secret Blocklist that Hides YouTube Hate Videos from Advertisers—But It’s Full of Holes," and "Google Blocks Advertisers from Targeting Black Lives Matter YouTube Videos."
youtube keyword-lists hate hate-speech undocumented-endpoints algorithm-auditing racial-justice social-justice
Language:Jupyter Notebook 27
sidneykung / twitter_hate_speech_detection
Capstone project to automate Twitter hate speech detection with classification modeling.
classification hate-speech hate-speech-tweets logistic-regression nlp nlp-machine-learning twitter
Language:Jupyter Notebook 26
hate-alert / Fear-speech-analysis
Can fear be used for polarisation and spreading negativity? Our paper accepted in The Web conference 2021 tries to explore this question in light of public Whatsapp groups.
fear-speech whatsapp hate-speech transformers survey facebook-ads hatespeech natural-language-processing paper fearspeech whatsapp-groups dataset
Language:Jupyter Notebook 24
iVerify-Apps
undp / iVerify-Apps
iVerify Apps: Apps that support the AI-powered iVerify platform to combat misinformation and hate speech
misinformation disinformation hate-speech information-pollution elections
Language:TypeScript 21
richouzo / hate-speech-detection-survey
Trained Neural Networks (LSTM, HybridCNN/LSTM, PyramidCNN, Transformers, etc.) & comparison for the task of Hate Speech Detection on the OLID Dataset (Tweets).
nlp machine-learning deep-learning natural-language-processing twitter hatespeech offensive-language social-media hate-speech hate-speech-detection bert transformers huggingface captum xai
Language:Jupyter Notebook 20
MilaNLProc / honest
A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.
nlp embeddings transformer nlp-library nlp-machine-learning bert hatespeech hate-speech offensive-language toxicity natural-language-processing
Language:Python 19
jaeyk / intersectional-bias-in-ml
Intersectional bias in hate speech and abusive language datasets
abusive-language bias fairness hate-speech icwsm machine-learning twitter
Language:Jupyter Notebook 14
hate-alert / HateALERT-EVALITA
Code for replicating results of team 'hateminers' at EVALITA-2018 for AMI task
classification hate-speech misogyny nlp universal-sentence-encoder glove-embeddings tfidf hatespeech
Language:Jupyter Notebook 13
eigenfoo / reddit-clusters
Understanding hateful subreddits through text clustering
reddit hate-speech text-clustering nmf
Language:Python 11
violentometro-online-team / violentometro-online
Prototype to detect Spanish hate-speech against women online.
hate-speech women data-science streamlit heroku spanish-hate-speech
Language:Jupyter Notebook 11
polids
AndreCNF / polids
Analysis of electoral manifestos and output of it through apps.
dashboard data-science data-visualization hate-speech natural-language-processing nlp politics sentiment-analysis
Language:Python 10
Social-AI-Studio / HatReD
Dataset and code implementation for the paper "Decoding the Underlying Meaning of Multimodal Hateful Memes" (IJCAI'23).
content-moderation hateful-memes hate-speech multimodal dataset
Language:Python 10
AfriHate / AfriHate
This is a repository for AfriHate Project
hate-speech hateful hatespeech offensive offensive-language toxic
8
fmplaza / hate-speech-spanish-lexicons
Useful resources for hate speech detection in Spanish
lexicons spanish hate-speech-detection hate-speech
8
faizaladhitama / Hierarchical-Multi-Label-Classification-API
Hierarchical Multi Label Hate Speech and Abusive Language Classification
hatespeech multi-label multilabel classification multilabel-classification indonesian indonesian-hatespeech twitter text-classification text-classification-python indonesian-research abusive-language hate-speech-detection hate-speech ujaran-kebencian kata-kasar ujaran indonesia
Language:Python 7
unnathi10 / HateSpeechDetection
Detecting hate speech using the spoken content of videos using Machine Learning
machine-learning deep-learning google-speech-to-text hate-speech python rnn naive-bayes linear-svm random-forest roc-curve
Language:Jupyter Notebook 7
yilingchung / counternarrative-resources
Resources on hate speech moderation through counter narratives, counter speech
counter-narratives hate-speech nlp survey counterspeech
7
ciwang / deep_hatespeech
Classifying hate speech with deep learning (honors thesis 2017-18)
natural-language-processing computational-social-science hatespeech offensive-language hate-speech discrimination nlp machine-learning
Language:Jupyter Notebook 6
nsbarsukov / toxic-comments-detector
Репозиторий python-пакета "toxicity". Выявление токсичного контента в русскоязычных текстах c помощью глубокого обучения.
deep-learning toxic-comment-classification toxic-comments-detector toxicity hate-speech
Language:Jupyter Notebook 6
thepharmproject / set_of_scripts
PHARM (Preventing Hate Against Refugees and Migrants) is a European project funded by the European Union, within Rights, Equality and Citizenship program. The main goal of the project is to monitor and model hate speech against refugees and migrants in Greece, Italy and Spain in order to predict and combat hate crime.
hate-speech migrants refugees detection
Language:Python 6
kmi-linguistics / trac-1
Repository hosting dataset for the Shared Task on Aggression Identification during First Workshop on Trolling, Aggression and Cyberbullying (TRAC - 1) as COLING - 2018. Please visit the workshop website - https://sites.google.com/view/trac1/home - for more details
aggression trac1 trolling cyberbullying cyberbullying-detection aggression-identification hate-speech abuse-detection abusive-language coling2018 coling-2018 social-media facebook social-network
5
rafaelgreca / TFG
Comparação de algoritmos de aprendizado profundo na classificação de comentários contendo discurso de ódio na internet.
sentiment-analysis analise-de-sentimentos discurso-de-odio hate-speech hate-speech-detection lstm rnc cnn deep-learning aprendizado-profundo python keras
Language:Jupyter Notebook 5
jpcorb20 / toxic-comment-server
Models to detect hateful comments served with flask trained on Kaggle's Toxic Comment Classification Challenge dataset.
flask hate-speech huggingface-transformers kaggle-dataset sklearn torch
Language:Python 4
yilingchung / Towards_KN_CN_Generation
Knowledge-bound counter speech generation to challenge hate speech
counter-narratives counter-speech hate-speech nlg nlp
Language:Julia 4

hate-speech

unitaryai / detoxify

t-davidson / hate-speech-and-offensive-language

kocohub / korean-hate-speech

Hironsan / HateSonar

hate-alert / HateXplain

surge-ai / toxicity

hate-alert / DE-LIMIT

manoelhortaribeiro / HatefulUsersTwitter

ruanchaves / napolab

hate-alert / Hate-Speech-Reading-List

hate-alert / Tutorial-Resources

phusroyal / ViHOS

the-markup / investigation-youtube-ad-placements

sidneykung / twitter_hate_speech_detection

hate-alert / Fear-speech-analysis

undp / iVerify-Apps

richouzo / hate-speech-detection-survey

MilaNLProc / honest

jaeyk / intersectional-bias-in-ml

hate-alert / HateALERT-EVALITA

eigenfoo / reddit-clusters

violentometro-online-team / violentometro-online

AndreCNF / polids

Social-AI-Studio / HatReD

AfriHate / AfriHate

fmplaza / hate-speech-spanish-lexicons

faizaladhitama / Hierarchical-Multi-Label-Classification-API

unnathi10 / HateSpeechDetection

yilingchung / counternarrative-resources

ciwang / deep_hatespeech

nsbarsukov / toxic-comments-detector

thepharmproject / set_of_scripts

kmi-linguistics / trac-1

rafaelgreca / TFG

jpcorb20 / toxic-comment-server

yilingchung / Towards_KN_CN_Generation