The sense and non-sense of holdout sample validation in the presence of endogeneity

P. Ebbes, D. Papies, H.J. van Heerde

Research output: Contribution to journalArticleScientificpeer-review

49 Citations (Scopus)

Abstract

Market response models based on field-generated data need to address potential endogeneity in the regressors to obtain consistent parameter estimates. Another requirement is that market response models predict well in a holdout sample. With both requirements combined, it may seem reasonable to subject an endogeneity-corrected model to a holdout prediction task, and this is quite common in the academic marketing literature. One may be inclined to expect that the consistent parameter estimates obtained via instrumental variables (IV) estimation predict better than the biased ordinary least squares (OLS) estimates. This paper shows that this expectation is incorrect. That is, if the holdout sample is similar to the estimation sample so that the regressors are endogenous in both samples, holdout sample validation favors regression estimates that are not corrected for endogeneity (i.e., OLS) over estimates that are corrected for endogeneity (i.e., IV estimation). We also discuss ways in which holdout samples may be used sensibly in the presence of endogeneity. A key takeaway is that if consistent parameter estimates are the primary model objective, the model should be validated with an exogenous (rather than endogenous) holdout sample.
Original languageEnglish
Pages (from-to)1115-1122
JournalMarketing Science
Volume30
Issue number6
DOIs
Publication statusPublished - 2011

Fingerprint Dive into the research topics of 'The sense and non-sense of holdout sample validation in the presence of endogeneity'. Together they form a unique fingerprint.

  • Cite this