Gautamshahi / FakeCovid

FakeCovid- A Multilingual Cross-domain Fact Check News Dataset for COVID-19

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

FakeCovid- A Multilingual Cross-domain Fact Check News Dataset for COVID-19

Dataset for FakeCovid

This github repository corresponds dataset used for our research article titled FakeCovid- A Multilingual Cross-domain Fact Check News Dataset for COVID-19.

FakeCovid is the first multilingual cross-domain dataset of 7623 fact-checked news articles for COVID-19, collected from 04/01/2020 to 01/07/2020. We have collected the fact-checked articles from 92 fact-checking websites after obtaining references from Poynter and Snopes. We have manually annotated the collected articles into 11 categories of the fact-checked news according to their content. The ultimately generated dataset is in 40 languages from 105 countries.

The work has been accepted in the Workshop on Cyber Social Threats (CySoc 2020) at 14th International Conference on Web and Social Media 2020.

How do I cite this work?

For now, cite ICWSM Workshop paper:

@article{shahifakecovid,
  title={FakeCovid-A Multilingual Cross-domain Fact Check News Dataset for COVID-19},
  author={Shahi, Gautam Kishore and Nandini, Durgesh}
}

Contact information

For help or issues using data, please submit a GitHub issue.

For personal communication related to our work, please contact Gautam Kishore Shahi(gautamshahi16@gmail.com) and Durgesh Nandini(durgeshnandini16@yahoo.in).

More udpdate

For more update on the related publication on the topic of FakeCovid, please visit https://gautamshahi.github.io/FakeCovid/

About

FakeCovid- A Multilingual Cross-domain Fact Check News Dataset for COVID-19

License:Creative Commons Zero v1.0 Universal


Languages

Language:Jupyter Notebook 99.6%Language:HTML 0.2%Language:CSS 0.2%