Principled missing data treatments

K.M. Lang, Todd D. Little

Research output: Contribution to journalArticleScientificpeer-review

Abstract

We review a number of issues regarding missing data treatments for intervention and prevention researchers. Many of the common missing data practices in prevention research are still, unfortunately, ill-advised (e.g., use of listwise and pairwise deletion, insufficient use of auxiliary variables). Our goal is to promote better practice in the handling of missing data. We review the current state of missing data methodology and recent missing data reporting in prevention research. We describe antiquated, ad hoc missing data treatments and discuss their limitations. We discuss two modern, principled missing data treatments: multiple imputation and full information maximum likelihood, and we offer practical tips on how to best employ these methods in prevention research. The principled missing data treatments that we discuss are couched in terms of how they improve causal and statistical inference in the prevention sciences. Our recommendations are firmly grounded in missing data theory and well-validated statistical principles for handling the missing data issues that are ubiquitous in biosocial and prevention research. We augment our broad survey of missing data analysis with references to more exhaustive resources.

Original languageEnglish
Pages (from-to)284-294
JournalPrevention Science
Volume19
Issue number3
DOIs
Publication statusPublished - 2018

Keywords

  • Missing data
  • Multiple imputation
  • Full information maximum likelihood
  • Auxiliary variables
  • Intent-to-treat
  • Statistical inference
  • INFORMATION MAXIMUM-LIKELIHOOD
  • MULTIPLE IMPUTATION
  • MULTIVARIATE IMPUTATION
  • REPORTING PRACTICES
  • SAMPLE SELECTION
  • DROP-OUT
  • MODELS
  • VARIABLES
  • SPECIFICATION
  • NONRESPONSE

Cite this

Lang, K.M. ; Little, Todd D. / Principled missing data treatments. In: Prevention Science. 2018 ; Vol. 19, No. 3. pp. 284-294.
@article{97ec3a78e08a4b0ba5c9f2352cb51af3,
title = "Principled missing data treatments",
abstract = "We review a number of issues regarding missing data treatments for intervention and prevention researchers. Many of the common missing data practices in prevention research are still, unfortunately, ill-advised (e.g., use of listwise and pairwise deletion, insufficient use of auxiliary variables). Our goal is to promote better practice in the handling of missing data. We review the current state of missing data methodology and recent missing data reporting in prevention research. We describe antiquated, ad hoc missing data treatments and discuss their limitations. We discuss two modern, principled missing data treatments: multiple imputation and full information maximum likelihood, and we offer practical tips on how to best employ these methods in prevention research. The principled missing data treatments that we discuss are couched in terms of how they improve causal and statistical inference in the prevention sciences. Our recommendations are firmly grounded in missing data theory and well-validated statistical principles for handling the missing data issues that are ubiquitous in biosocial and prevention research. We augment our broad survey of missing data analysis with references to more exhaustive resources.",
keywords = "Missing data, Multiple imputation, Full information maximum likelihood, Auxiliary variables, Intent-to-treat, Statistical inference, INFORMATION MAXIMUM-LIKELIHOOD, MULTIPLE IMPUTATION, MULTIVARIATE IMPUTATION, REPORTING PRACTICES, SAMPLE SELECTION, DROP-OUT, MODELS, VARIABLES, SPECIFICATION, NONRESPONSE",
author = "K.M. Lang and Little, {Todd D.}",
year = "2018",
doi = "10.1007/s11121-016-0644-5",
language = "English",
volume = "19",
pages = "284--294",
journal = "Prevention Science",
issn = "1389-4986",
publisher = "Springer Verlag",
number = "3",

}

Principled missing data treatments. / Lang, K.M.; Little, Todd D.

In: Prevention Science, Vol. 19, No. 3, 2018, p. 284-294.

Research output: Contribution to journalArticleScientificpeer-review

TY - JOUR

T1 - Principled missing data treatments

AU - Lang, K.M.

AU - Little, Todd D.

PY - 2018

Y1 - 2018

N2 - We review a number of issues regarding missing data treatments for intervention and prevention researchers. Many of the common missing data practices in prevention research are still, unfortunately, ill-advised (e.g., use of listwise and pairwise deletion, insufficient use of auxiliary variables). Our goal is to promote better practice in the handling of missing data. We review the current state of missing data methodology and recent missing data reporting in prevention research. We describe antiquated, ad hoc missing data treatments and discuss their limitations. We discuss two modern, principled missing data treatments: multiple imputation and full information maximum likelihood, and we offer practical tips on how to best employ these methods in prevention research. The principled missing data treatments that we discuss are couched in terms of how they improve causal and statistical inference in the prevention sciences. Our recommendations are firmly grounded in missing data theory and well-validated statistical principles for handling the missing data issues that are ubiquitous in biosocial and prevention research. We augment our broad survey of missing data analysis with references to more exhaustive resources.

AB - We review a number of issues regarding missing data treatments for intervention and prevention researchers. Many of the common missing data practices in prevention research are still, unfortunately, ill-advised (e.g., use of listwise and pairwise deletion, insufficient use of auxiliary variables). Our goal is to promote better practice in the handling of missing data. We review the current state of missing data methodology and recent missing data reporting in prevention research. We describe antiquated, ad hoc missing data treatments and discuss their limitations. We discuss two modern, principled missing data treatments: multiple imputation and full information maximum likelihood, and we offer practical tips on how to best employ these methods in prevention research. The principled missing data treatments that we discuss are couched in terms of how they improve causal and statistical inference in the prevention sciences. Our recommendations are firmly grounded in missing data theory and well-validated statistical principles for handling the missing data issues that are ubiquitous in biosocial and prevention research. We augment our broad survey of missing data analysis with references to more exhaustive resources.

KW - Missing data

KW - Multiple imputation

KW - Full information maximum likelihood

KW - Auxiliary variables

KW - Intent-to-treat

KW - Statistical inference

KW - INFORMATION MAXIMUM-LIKELIHOOD

KW - MULTIPLE IMPUTATION

KW - MULTIVARIATE IMPUTATION

KW - REPORTING PRACTICES

KW - SAMPLE SELECTION

KW - DROP-OUT

KW - MODELS

KW - VARIABLES

KW - SPECIFICATION

KW - NONRESPONSE

U2 - 10.1007/s11121-016-0644-5

DO - 10.1007/s11121-016-0644-5

M3 - Article

VL - 19

SP - 284

EP - 294

JO - Prevention Science

JF - Prevention Science

SN - 1389-4986

IS - 3

ER -