TY - JOUR
T1 - Using machine learning to assess the predictive potential of standardized nursing data for home healthcare case-mix classification
AU - de Korte, Maud H.
AU - Verhoeven, Gertjan S.
AU - Elissen, Arianne M. J.
AU - Metzelthin, Silke F.
AU - Ruwaard, Dirk
AU - Mikkers, Misja C.
PY - 2020
Y1 - 2020
N2 - Background The Netherlands is currently investigating the feasibility of moving from fee-for-service to prospective payments for home healthcare, which would require a suitable case-mix system. In 2017, health insurers mandated a preliminary case-mix system as a first step towards generating information on client differences in relation to care use. Home healthcare providers have also increasingly adopted standardized nursing terminology (SNT) as part of their electronic health records (EHRs), providing novel data for predictive modelling. Objective To explore the predictive potential of SNT data for improvement of the existing preliminary Dutch case-mix classification for home healthcare utilization. Methods We extracted client-level data from the EHRs of a large home healthcare provider, including data from the existing Dutch case-mix system, SNT data (specifically, NANDA-I) and the hours of home healthcare provided. We evaluated the predictive accuracy of the case-mix system and the SNT data separately, and combined, using the machine learning algorithm Random Forest. Results The case-mix system had a predictive performance of 22.4% cross-validatedR-squared and 6.2% cross-validated Cumming's Prediction Measure (CPM). Adding SNT data led to a substantial relative improvement in predicting home healthcare hours, yielding 32.1%R-squared and 15.4% CPM. Discussion The existing preliminary Dutch case-mix system distinguishes client needs to some degree, but not sufficiently. The results indicate that routinely collected SNT data contain sufficient additional predictive value to warrant further research for use in case-mix system design.
AB - Background The Netherlands is currently investigating the feasibility of moving from fee-for-service to prospective payments for home healthcare, which would require a suitable case-mix system. In 2017, health insurers mandated a preliminary case-mix system as a first step towards generating information on client differences in relation to care use. Home healthcare providers have also increasingly adopted standardized nursing terminology (SNT) as part of their electronic health records (EHRs), providing novel data for predictive modelling. Objective To explore the predictive potential of SNT data for improvement of the existing preliminary Dutch case-mix classification for home healthcare utilization. Methods We extracted client-level data from the EHRs of a large home healthcare provider, including data from the existing Dutch case-mix system, SNT data (specifically, NANDA-I) and the hours of home healthcare provided. We evaluated the predictive accuracy of the case-mix system and the SNT data separately, and combined, using the machine learning algorithm Random Forest. Results The case-mix system had a predictive performance of 22.4% cross-validatedR-squared and 6.2% cross-validated Cumming's Prediction Measure (CPM). Adding SNT data led to a substantial relative improvement in predicting home healthcare hours, yielding 32.1%R-squared and 15.4% CPM. Discussion The existing preliminary Dutch case-mix system distinguishes client needs to some degree, but not sufficiently. The results indicate that routinely collected SNT data contain sufficient additional predictive value to warrant further research for use in case-mix system design.
KW - Case-mix
KW - Home care
KW - Electronic health records
KW - Machine learning
KW - Predictive modelling
KW - RISK
KW - SYSTEMS
U2 - 10.1007/s10198-020-01213-9
DO - 10.1007/s10198-020-01213-9
M3 - Article
SN - 1618-7598
VL - 21
SP - 1121
EP - 1129
JO - European Journal of Health Economics
JF - European Journal of Health Economics
ER -