Abstract
The sensitivity and the specificity of four outlier scores were studied for four different discordancy tests. The outlier scores were the Mahalanobis distance, a robust version of the Mahalanobis distance, and two measures tailored to discrete data, known as O+ and G+. The discordancy tests were Tukey’s fences (a.k.a. boxplot). Tukey’s fences with adjustment for skewness (adjusted boxplot), the generalized extreme studentized deviate (ESD), and the transformed ESD (ESD-T). Outlier scores O+ and G+ performed better than the Mahalanobis distance and its robust version. Discordancy tests ESD-T and adjusted boxplot were advocated for high specificity and ESD for high sensitivity.
Keywords: discordancy test, Mahalanobis distance, outlier detection in questionnaire data, outlier score O+, outlier score G+, robust Mahalanobis distance
Keywords: discordancy test, Mahalanobis distance, outlier detection in questionnaire data, outlier score O+, outlier score G+, robust Mahalanobis distance
Original language | English |
---|---|
Pages (from-to) | 69-77 |
Journal | Methodology: European Journal of Research Methods for the Behavioral and Social Sciences |
Volume | 9 |
Issue number | 2 |
DOIs | |
Publication status | Published - 2013 |