Abstract
A Multivariate Mixed method for Statistical Matching (MMSM) is proposed. The MMSM is a predictive mean matching method to impute values when integrating two datasets from the same population without overlapping units measuring several common and non-common variables. It considers the multivariate structure of the data by using multivariate Bayesian regression. The MMSM can also include auxiliary information from an additional dataset to improve the computation of intermediate values, and constraints to improve the selection of the donors. The results from a simulation study show that including information from an auxiliary dataset leads to far better results, especially in terms of bias and percentage of correct imputations. The inclusion of constraints also increases the quality of the imputations, and hence of the statistical matching.
Original language | English |
---|---|
Article number | 107569 |
Number of pages | 14 |
Journal | Computational Statistics & Data Analysis |
Volume | 177 |
DOIs | |
Publication status | Published - 2023 |
Keywords
- ADJUSTED WEIGHTS
- Auxiliary dataset
- FILE CONCATENATION
- Hard constraints
- IMPUTATION
- Multiple imputation
- Predictive mean matching
- Soft constraints