adlnlp / K-MHaS

This repository contains Korean Hate Speech dataset for paper, "K-MHaS: A Multi-label Hate Speech Detection Dataset in Korean Online News Comment", accepted by COLING2022.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Jean Lee, Taejun Lim, Heejun Lee, Bogeun Jo,
Yangsok Kim, Heegeun Yoon and Soyeon Caren Han.

Accepted by the 29th International Conference on Computational Linguistics
(COLING 2022).

Open In Colab

  • 26 Nov 2022 : HuggingFace update our dataset can be loaded through huggingface.

๐ŸŽฅ Paper Presentation : Please click on the image below.

Korean Multi-label Hate Speech Dataset

We introduce K-MHaS, a new multi-label dataset for hate speech detection that effectively handles Korean language patterns.

  • consisting of 109,692 utterances from Korean online news comments, labelled with 8 fine-grained hate speech classes.
  • providing (a)binary classification and (b)multi-label classification from 1(one) to 4(four) labels.
  • data collection period : between January 2018 and June 2020.

Multi-label Annotation

Compared with previous studies, the multi-label annotation scheme allows non-exclusive concepts, accounting for the overlapping shades of given categories.

Our annotation scheme has two layers:

  • (a) binary classification (Hate Speech or Not Hate Speech) and
  • (b) fine-grained classification.
    • Politics(์ •์น˜์„ฑํ–ฅ์ฐจ๋ณ„)
    • Origin(์ถœ์‹ ์ฐจ๋ณ„)
    • Physical(์™ธ๋ชจ์ฐจ๋ณ„)
    • Age(์—ฐ๋ น์ฐจ๋ณ„)
    • Gender(์„ฑ์ฐจ๋ณ„)
    • Religion(์ข…๊ต์ฐจ๋ณ„)
    • Race(์ธ์ข…์ฐจ๋ณ„)
    • Profanity(ํ˜์˜ค์š•์„ค)

For the fine-grained classification, a Hate Speech class from the binary classification, is broken down into eight classes, associated with the hate speech category. In order to reflect the social and historical context, we select the eight hate speech classes. For example, the politics class is chosen, due to a significant influence on the style of Korean hate speech.

  • Comparison of datasets

Dataset Details

In this repository, we provide splitted datasets that have 78,977(train) / 8,776 (validation) / 21,939 (test) samples, preserving the class proportion. The label numbers matching in both English and Korean is below.

label types 0 1 2 3 4 5 6 7 8
En Origin Physical Politics Profanity Age Gender Race Religion Not Hate Speech
Kr ์ถœ์‹ ์ฐจ๋ณ„ ์™ธ๋ชจ์ฐจ๋ณ„ ์ •์น˜์„ฑํ–ฅ์ฐจ๋ณ„ ํ˜์˜ค์š•์„ค ์—ฐ๋ น์ฐจ๋ณ„ ์„ฑ์ฐจ๋ณ„ ์ธ์ข…์ฐจ๋ณ„ ์ข…๊ต์ฐจ๋ณ„ ํ•ด๋‹น์‚ฌํ•ญ์—†์Œ
document	label
์ขŒ์ข€ ํ™์–ด๋“ค์ด ์ผ๋ฒ ๋ณด๋‹ค ๋” ๊ทนํ˜์ด๋‹ค	0,2
์™€ ์ด๊ฑฐ ์™„์ „ ์žฌ์•™์ด๋„ค. ๊นจ๋—ํ•œ ๋‚˜๋ผ ๋งŒ๋“ ๋‹ค๋ฉด์„œ ์™œ ์ด๋Ÿฐ ์‚ฌ๊ฑด์ด ํ„ฐ์ง€๋ƒ??	2
๋ชฌ์ƒ๊ธด ์—ฌ์ž์ธ๊ฐ„๋“ค์€ ์ฐŒ๊ทธ๋Ÿฌ์ ธ์žˆ์–ด๋ผ ํ™•๊ทธ๋ƒฅ ๋ง‰๊ทธ๋ƒฅ	1,5
์˜ค๋ฉด ํ™”์—ผ๋ณ‘ ๋˜์ง„๋‹ค ๋ผ์ง€์ƒˆํ‚ค์•ผ	1,3
์ˆ˜๊ผดํ‹€๋”ฑ์‹œํ‚ค๋“ค์ด ๋‹ค ๋””์ ธ์•ผ ๋‚˜๋ผ๊ฐ€ ๋˜‘๋ฐ”๋กœ ๋ ๊ฒƒ๊ฐ™๋‹ค..๋‹ต์ด ์—†๋Š” ์ข…์ž๋“คใ… 	2,3,4
ใ…‡ใ…‡? ์–‘๋†ˆ์ด๊ฑด ๊นœ์”จ๊ฑด ๋ญ” ์•„๋ฐฉ๊ฐ€๋ฅด๋“œํ•˜๋„ค	0,6
OOO์ฒœ์ง€ ๊ฐœ๋…๊ต ์ƒˆ๋ผ๋“ค์•„ ๋‹ค ์ฐฉํ•ด์ ธ๋ผ!	3,7
๋‹ค๋“ค ์˜ค์ง€๋ž– ์ข€...๋˜๊ฒŒ ํ•  ์ง“ ์—†์–ด๋ณด์—ฌ์š”	8

Experiments

We evaluate strong baseline experiments on our dataset using Korean-BERT-based language models with six different metrics (F1-[macro, micro, weighted], Exact Match, AUC and Hamming Loss).

  • MultiBERT (Wolf et al., 2020) : pre-trained on the Wikipedia in 104 different languages (110M parameters and 119K vocab.).
  • KoELECTRA (Park, 2020) : pre-trained on 34GB Korean news, Wikipedia, Namu (Korean wiki) and Modu (14M parameters and 35K vocab.).
  • KoBERT (SKTBrain, 2019) : pre-trained on 54M words from Korean Wikipedia (92M parameters and 8K vocab.).
  • KR-BERT (Lee et al., 2020) : pre-trained on 2.47GB corpus with 233M words from Korean Wikipedia and news, applying either
    • (1) the character-level tokenizer; or
    • (2) the sub-character-level tokenizer.

The overall performance for all labels

  • The micro F1-score range between 0.8139 (Multi-BERT) and 0.8500 (KR-BERT-c: using character-level tokenizer).
  • KoELECTRA obtains overall the best or second best among six metrics.
  • This indicates the effects of the pre-training data source, considering that the KoELECTRA corpora contains modern slang and buzzwords.


The breakdown performance for the multi-label classification

  • KR-BERT using sub-character-level tokenizer achieved the best, or second best performance.
  • The sub-character tokeniser can decompose Hangul(Korean language) syllable characters into sub-characters.
  • It provides a great ability to detect a hate speech word, composed by each character in different hate speech labels.

Contributors

The contributors of the work are:

  • Jean Lee (Ph.D. candidate at the University of Sydney)
  • Taejun Lim (Mphil. candidate at the University of Sydney)
  • Heejun Lee (BigWave AI)
  • Bogeun Jo (BigWave AI)
  • Yangsok Kim (Professor at Keimyung University)
  • Heegeun Yoon (National Information Society Agency)
  • Soyeon Caren Han (Senior lecturer at the University of Melbourne)

Citation

If you use the dataset, please cite this our paper K-MHaS: A Multi-label Hate Speech Detection Dataset in Korean Online News Comment accepted by COLING2022.

@inproceedings{lee-etal-2022-k,
    title = "K-{MH}a{S}: A Multi-label Hate Speech Detection Dataset in {K}orean Online News Comment",
    author = "Lee, Jean  and
      Lim, Taejun  and
      Lee, Heejun  and
      Jo, Bogeun  and
      Kim, Yangsok  and
      Yoon, Heegeun  and
      Han, Soyeon Caren",
    booktitle = "Proceedings of the 29th International Conference on Computational Linguistics",
    month = oct,
    year = "2022",
    address = "Gyeongju, Republic of Korea",
    publisher = "International Committee on Computational Linguistics",
    url = "https://aclanthology.org/2022.coling-1.311",
    pages = "3530--3538",
    abstract = "Online hate speech detection has become an important issue due to the growth of online content, but resources in languages other than English are extremely limited. We introduce K-MHaS, a new multi-label dataset for hate speech detection that effectively handles Korean language patterns. The dataset consists of 109k utterances from news comments and provides a multi-label classification using 1 to 4 labels, and handles subjectivity and intersectionality. We evaluate strong baselines on K-MHaS. KR-BERT with a sub-character tokenizer outperforms others, recognizing decomposed characters in each hate speech class.",
}

About

This repository contains Korean Hate Speech dataset for paper, "K-MHaS: A Multi-label Hate Speech Detection Dataset in Korean Online News Comment", accepted by COLING2022.


Languages

Language:Jupyter Notebook 100.0%