Detection of primary Sjögren’s syndrome in primary care: Developing a classification model with the use of routine healthcare data and machine learning

Jesper T. Dros*, Isabelle Bos, Frank C. Bennis, Sytske Wiegersma, John Paget, Chiara Seghieri, Jaime Barrio Cortés, Robert A. Verheij

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

51 Downloads (Pure)

Abstract

Background
Primary Sjögren’s Syndrome (pSS) is a rare autoimmune disease that is difficult to diagnose due to a variety of clinical presentations, resulting in misdiagnosis and late referral to specialists. To improve early-stage disease recognition, this study aimed to develop an algorithm to identify possible pSS patients in primary care. We built a machine learning algorithm which was based on combined healthcare data as a first step towards a clinical decision support system.

Method
Routine healthcare data, consisting of primary care electronic health records (EHRs) data and hospital claims data (HCD), were linked on patient level and consisted of 1411 pSS and 929,179 non-pSS patients. Logistic regression (LR) and random forest (RF) models were used to classify patients using age, gender, diseases and symptoms, prescriptions and GP visits.

Results
The LR and RF models had an AUC of 0.82 and 0.84, respectively. Many actual pSS patients were found (sensitivity LR = 72.3%, RF = 70.1%), specificity was 74.0% (LR) and 77.9% (RF) and the negative predictive value was 99.9% for both models. However, most patients classified as pSS patients did not have a diagnosis of pSS in secondary care (positive predictive value LR = 0.4%, RF = 0.5%).

Conclusion
This is the first study to use machine learning to classify patients with pSS in primary care using GP EHR data. Our algorithm has the potential to support the early recognition of pSS in primary care and should be validated and optimized in clinical practice. To further enhance the algorithm in detecting pSS in primary care, we suggest it is improved by working with experienced clinicians.
Original languageEnglish
Article number199
JournalBMC Primary Care
Volume23
DOIs
Publication statusPublished - 2022

Keywords

  • primary sjogren's syndrome
  • Machine Learning
  • routine healthcare data
  • Primary care

Fingerprint

Dive into the research topics of 'Detection of primary Sjögren’s syndrome in primary care: Developing a classification model with the use of routine healthcare data and machine learning'. Together they form a unique fingerprint.

Cite this