MohamedHadjAmeur / AraCovid19-SSD

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

AraCovid19-SSD: Arabic COVID-19 Sentiment and Sarcasm Detection Dataset

How to get the dataset:

To get the full dataset please contact (you can contact all these emails at the same time to get the quickest response): mohamedhadjameur@gmail.com, ahassina@cerist.dz, drdhn@cerist.dz

Description:

AraCOVID19-MFH Arxiv URL is a manually annotated multi-label Arabic COVID-19 Sentiment and Sarcasm Detection Dataset. The dataset contains 5,162 annotated tweets. AraCOVID19-SSD labels, values, and their signification are provided in the below Table:

An example of the instances present in the dataset are provided in the below Table:

Content:

Statistics about the number of tweets in each topic are provided in the below Table:

Data Retrieval:

We provided only the user IDs following Twitter’s Terms of Service.

Tools such as Twarc or Hydrator can be used to retrieve the tweets using their IDs. In case of any problem you can contact the authors using the email provided in the below section.

License:

The AraCOVID19-SSD dataset is licensed under Creative Commons Attribution-Noncommercial-ShareAlike 4.0 CC BY-NC-SA 4.0.

Citations:

Please cite as:

@misc{ameur2021aracovid19ssd,
      title={AraCOVID19-SSD: Arabic COVID-19 Sentiment and Sarcasm Detection Dataset}, 
      author={Mohamed Seghir Hadj Ameur and Hassina Aliane},
      year={2021},
      eprint={2110.01948},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Contacts:

To get the full dataset, or to get additional information please contact mohamedhadjameur@gmail.com, ahassina@cerist.dz, or drdhn@cerist.dz

About