An interpretable machine learning approach for detecting psoriatic arthritis in a UK primary care psoriasis cohort using electronic health records from the Clinical Practice Research Datalink

Research output: Contribution to journalArticlepeer-review

1 Citation (SciVal)

Abstract

Objectives: Develop an interpretable machine learning model to detect patients with newly diagnosed psoriatic arthritis (PsA) in a cohort of psoriasis patients and identify important clinical indicators of PsA in primary care.

Methods: We developed models using UK primary care electronic health records from the Clinical Practice Research Datalink (CPRD). The study population consisted of a cohort of (PsA free) patients with incident psoriasis who were followed prospectively. We used Bayesian networks (BNs) to identify patients who developed PsA using primary care variables measured prior to diagnosis and compared the results to a random forest (RF). Variables included patient demographics, musculoskeletal symptoms, blood tests, and prescriptions. The importance of each variable used in the models was evaluated using permutation variable importance. Model discrimination was measured using the area under the receiver operating characteristic curve (AUC) and the area under the precision-recall curve (PRAUC).

Results: We identified a cohort of 122,330 patients with an incident psoriasis diagnosis between 1998 and 2019 in the CPRD, of whom 2460 patients went on to develop PsA. Our best BN achieved an AUC of 0.823, and PRAUC of 0.221, compared to the AUC of 0.851 and PRAUC of 0.261 of the RF. Psoriasis duration, nonsteroidal anti-inflammatory drug prescriptions, nonspecific arthritis, nonspecific arthralgia, and C-reactive protein blood tests were all important variables in our models.

Conclusions: We were able to identify psoriasis patients at higher risk, and important indicators, of PsA in UK primary care. Further work is required to evaluate our model's usefulness in assisting PsA screening.
Original languageEnglish
Pages (from-to)575-583
Number of pages9
JournalAnnals of the Rheumatic Diseases
Volume84
Issue number4
Early online date1 Mar 2025
DOIs
Publication statusPublished - Apr 2025

Bibliographical note

Publisher Copyright:
© 2025 The Author(s)

Data Availability Statement

Data used for this study is not publicly available. Data from the Clinical Practice Research Datalink is under licence from the UK Medicines and Healthcare Products Regulatory Agency, which restricts the sharing of data to persons outside the study protocol.

Acknowledgements

The authors gratefully acknowledge the University of Bath's Research Computing Group (doi.org/10.15125/b6cd-s854) for their support in this work.

Funding

This Investigator Initiated Study was financially supported by UCB Biopharma SRL. The authors gratefully acknowledge the University of Bath's Research Computing Group (doi.org/10.15125/b6cd-s854) for their support in this work. This investigator-initiated study was financially supported by UCB Biopharma SRL. Not applicable. The use of Clinical Practice Research Datalink data for this study was approved by the Research Data Governance Process from the Medicines and Healthcare products Regulatory Agency (Protocol ID 21_000578). Not commissioned; externally peer reviewed. Data used for this study is not publicly available. Data from the Clinical Practice Research Datalink is under licence from the UK Medicines and Healthcare Products Regulatory Agency, which restricts the sharing of data to persons outside the study protocol. Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.

FundersFunder number
UCB
UK Medicines and Healthcare Products Regulatory Agency
Medicines and Healthcare products Regulatory Agency21_000578

    Fingerprint

    Dive into the research topics of 'An interpretable machine learning approach for detecting psoriatic arthritis in a UK primary care psoriasis cohort using electronic health records from the Clinical Practice Research Datalink'. Together they form a unique fingerprint.

    Cite this