CrowdTruth / Cross-Task-Majority-Vote-Eval

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CrowdTruth Evaluation Dataset

Corpus of crowdsourced annotations together with trusted judgments for 4 crowdsourcing tasks:

  1. medical relation extraction (files medical_*)
  2. Twitter event extraction (files tweets_*)
  3. news event identification (files events_*)
  4. sound interpretation (files sounds_*)

The dataset was used to evaluate the CrowdTruth crowdsourcing aggregation metrics. Details are available in the paper:

For each of the 4 tasks, 2 files are given:

  • *_raw.csv - Contains the judgments of individual workers for each of the tasks.

  • *_aggregated.csv - Contains the CrowdTruth aggregation of the judgments for each unit in the task expressed as the media unit - annotation score, as well as the trusted judgment.

About