Reputation as a sufficient condition for data quality on Amazon Mechanical Turk

E. Peer, J. Vosgerau, A. Acquisti

Research output: Contribution to journalArticleScientificpeer-review

Abstract

Data quality is one of the major concerns of using crowdsourcing websites such as Amazon Mechanical Turk (MTurk) to recruit participants for online behavioral studies. We compared two methods for ensuring data quality on MTurk: attention check questions (ACQs) and restricting participation to MTurk workers with high reputation (above 95% approval ratings). In Experiment 1, we found that high-reputation workers rarely failed ACQs and provided higher-quality data than did low-reputation workers; ACQs improved data quality only for low-reputation workers, and only in some cases. Experiment 2 corroborated these findings and also showed that more productive high-reputation workers produce the highest-quality data. We concluded that sampling high-reputation workers can ensure high-quality data without having to resort to using ACQs, which may lead to selection bias if participants who fail ACQs are excluded post-hoc.
Original languageEnglish
Pages (from-to)1023-1031
JournalBehavior Research Methods
Volume46
Issue number4
Early online date20 Dec 2013
DOIs
Publication statusPublished - Dec 2014

Fingerprint

Selection Bias
Turks
Workers
Amazon
Data Accuracy
Experiment
Participation
Rating
Sampling
Web Sites

Keywords

  • Online research
  • Amazon Mechanical Turk
  • data quality
  • reputation

Cite this

Peer, E. ; Vosgerau, J. ; Acquisti, A. / Reputation as a sufficient condition for data quality on Amazon Mechanical Turk. In: Behavior Research Methods. 2014 ; Vol. 46, No. 4. pp. 1023-1031.
@article{e6b5ea48e0df46c0ab9b84ab0743160b,
title = "Reputation as a sufficient condition for data quality on Amazon Mechanical Turk",
abstract = "Data quality is one of the major concerns of using crowdsourcing websites such as Amazon Mechanical Turk (MTurk) to recruit participants for online behavioral studies. We compared two methods for ensuring data quality on MTurk: attention check questions (ACQs) and restricting participation to MTurk workers with high reputation (above 95{\%} approval ratings). In Experiment 1, we found that high-reputation workers rarely failed ACQs and provided higher-quality data than did low-reputation workers; ACQs improved data quality only for low-reputation workers, and only in some cases. Experiment 2 corroborated these findings and also showed that more productive high-reputation workers produce the highest-quality data. We concluded that sampling high-reputation workers can ensure high-quality data without having to resort to using ACQs, which may lead to selection bias if participants who fail ACQs are excluded post-hoc.",
keywords = "Online research, Amazon Mechanical Turk, data quality, reputation",
author = "E. Peer and J. Vosgerau and A. Acquisti",
year = "2014",
month = "12",
doi = "10.3758/s13428-013-0434-y",
language = "English",
volume = "46",
pages = "1023--1031",
journal = "Behavior Research Methods",
issn = "1554-351X",
publisher = "Springer",
number = "4",

}

Reputation as a sufficient condition for data quality on Amazon Mechanical Turk. / Peer, E.; Vosgerau, J.; Acquisti, A.

In: Behavior Research Methods, Vol. 46, No. 4, 12.2014, p. 1023-1031.

Research output: Contribution to journalArticleScientificpeer-review

TY - JOUR

T1 - Reputation as a sufficient condition for data quality on Amazon Mechanical Turk

AU - Peer, E.

AU - Vosgerau, J.

AU - Acquisti, A.

PY - 2014/12

Y1 - 2014/12

N2 - Data quality is one of the major concerns of using crowdsourcing websites such as Amazon Mechanical Turk (MTurk) to recruit participants for online behavioral studies. We compared two methods for ensuring data quality on MTurk: attention check questions (ACQs) and restricting participation to MTurk workers with high reputation (above 95% approval ratings). In Experiment 1, we found that high-reputation workers rarely failed ACQs and provided higher-quality data than did low-reputation workers; ACQs improved data quality only for low-reputation workers, and only in some cases. Experiment 2 corroborated these findings and also showed that more productive high-reputation workers produce the highest-quality data. We concluded that sampling high-reputation workers can ensure high-quality data without having to resort to using ACQs, which may lead to selection bias if participants who fail ACQs are excluded post-hoc.

AB - Data quality is one of the major concerns of using crowdsourcing websites such as Amazon Mechanical Turk (MTurk) to recruit participants for online behavioral studies. We compared two methods for ensuring data quality on MTurk: attention check questions (ACQs) and restricting participation to MTurk workers with high reputation (above 95% approval ratings). In Experiment 1, we found that high-reputation workers rarely failed ACQs and provided higher-quality data than did low-reputation workers; ACQs improved data quality only for low-reputation workers, and only in some cases. Experiment 2 corroborated these findings and also showed that more productive high-reputation workers produce the highest-quality data. We concluded that sampling high-reputation workers can ensure high-quality data without having to resort to using ACQs, which may lead to selection bias if participants who fail ACQs are excluded post-hoc.

KW - Online research

KW - Amazon Mechanical Turk

KW - data quality

KW - reputation

U2 - 10.3758/s13428-013-0434-y

DO - 10.3758/s13428-013-0434-y

M3 - Article

VL - 46

SP - 1023

EP - 1031

JO - Behavior Research Methods

JF - Behavior Research Methods

SN - 1554-351X

IS - 4

ER -