The value of statistical tools to detect data fabrication

C.H.J. Hartgerink, J.M. Wicherts, M.A.L.M. van Assen

Research output: Contribution to journalArticleScientificpeer-review

Abstract

We aim to investigate how statistical tools can help detect potential data fabrication in the social- and medical sciences. In this proposal we outline three projects to assess the value of such statistical tools to detect potential data fabrication and make the first steps in order to apply them automatically to detect data anomalies, potentially due to data fabrication. In Project 1, we examine the performance of statistical methods to detect data fabrication in a mixture of genuine and fabricated data sets, where the fabricated data sets are generated by actual researchers who participate in our study. We also interview these researchers in order to investigate, in Project 2, different data fabrication characteristics and whether data generated with certain characteristics are better detected with current statistical tools than others. In Project 3 we use software to semi-automatically screen research articles to detect data anomalies that are potentially due to fabrication, and develop and test new software forming the basis for automated screening of research articles for data anomalies, potentially due to data fabrication, in the future.
Keywords: data fabrication; statistics; scientific misconduct; integrity
Original languageEnglish
Article numbere8860
JournalResearch Ideas and Outcomes
Volume2
DOIs
Publication statusPublished - 2016

Fingerprint

Fabrication
Statistical methods
Screening
Statistics

Cite this

@article{303d4da930a343e6889704bdba121fce,
title = "The value of statistical tools to detect data fabrication",
abstract = "We aim to investigate how statistical tools can help detect potential data fabrication in the social- and medical sciences. In this proposal we outline three projects to assess the value of such statistical tools to detect potential data fabrication and make the first steps in order to apply them automatically to detect data anomalies, potentially due to data fabrication. In Project 1, we examine the performance of statistical methods to detect data fabrication in a mixture of genuine and fabricated data sets, where the fabricated data sets are generated by actual researchers who participate in our study. We also interview these researchers in order to investigate, in Project 2, different data fabrication characteristics and whether data generated with certain characteristics are better detected with current statistical tools than others. In Project 3 we use software to semi-automatically screen research articles to detect data anomalies that are potentially due to fabrication, and develop and test new software forming the basis for automated screening of research articles for data anomalies, potentially due to data fabrication, in the future.Keywords: data fabrication; statistics; scientific misconduct; integrity",
author = "C.H.J. Hartgerink and J.M. Wicherts and {van Assen}, M.A.L.M.",
year = "2016",
doi = "10.3897/rio.2.e8860",
language = "English",
volume = "2",
journal = "Research Ideas and Outcomes",
issn = "2367-7163",

}

The value of statistical tools to detect data fabrication. / Hartgerink, C.H.J.; Wicherts, J.M.; van Assen, M.A.L.M.

In: Research Ideas and Outcomes, Vol. 2, e8860, 2016.

Research output: Contribution to journalArticleScientificpeer-review

TY - JOUR

T1 - The value of statistical tools to detect data fabrication

AU - Hartgerink, C.H.J.

AU - Wicherts, J.M.

AU - van Assen, M.A.L.M.

PY - 2016

Y1 - 2016

N2 - We aim to investigate how statistical tools can help detect potential data fabrication in the social- and medical sciences. In this proposal we outline three projects to assess the value of such statistical tools to detect potential data fabrication and make the first steps in order to apply them automatically to detect data anomalies, potentially due to data fabrication. In Project 1, we examine the performance of statistical methods to detect data fabrication in a mixture of genuine and fabricated data sets, where the fabricated data sets are generated by actual researchers who participate in our study. We also interview these researchers in order to investigate, in Project 2, different data fabrication characteristics and whether data generated with certain characteristics are better detected with current statistical tools than others. In Project 3 we use software to semi-automatically screen research articles to detect data anomalies that are potentially due to fabrication, and develop and test new software forming the basis for automated screening of research articles for data anomalies, potentially due to data fabrication, in the future.Keywords: data fabrication; statistics; scientific misconduct; integrity

AB - We aim to investigate how statistical tools can help detect potential data fabrication in the social- and medical sciences. In this proposal we outline three projects to assess the value of such statistical tools to detect potential data fabrication and make the first steps in order to apply them automatically to detect data anomalies, potentially due to data fabrication. In Project 1, we examine the performance of statistical methods to detect data fabrication in a mixture of genuine and fabricated data sets, where the fabricated data sets are generated by actual researchers who participate in our study. We also interview these researchers in order to investigate, in Project 2, different data fabrication characteristics and whether data generated with certain characteristics are better detected with current statistical tools than others. In Project 3 we use software to semi-automatically screen research articles to detect data anomalies that are potentially due to fabrication, and develop and test new software forming the basis for automated screening of research articles for data anomalies, potentially due to data fabrication, in the future.Keywords: data fabrication; statistics; scientific misconduct; integrity

U2 - 10.3897/rio.2.e8860

DO - 10.3897/rio.2.e8860

M3 - Article

VL - 2

JO - Research Ideas and Outcomes

JF - Research Ideas and Outcomes

SN - 2367-7163

M1 - e8860

ER -