Abstract

BACKGROUND: Machine learning (ML) models in healthcare are crucial for predicting clinical outcomes, and their effectiveness can be significantly enhanced through improvements in accuracy, generalisability, and interpretability. To achieve widespread adoption in clinical practice, risk factors identified by these models must be validated in diverse populations.

METHODS: In this cohort study, 86 870 community-dwelling older adults ≥65 years from the UK Biobank database were used to train and test three ML models to predict 30-day emergency hospitalisation. The three ML models, Random Forest (RF), XGBoost (XGB), and Logistic Regression (LR), utilised all extracted variables, consisting of demographic and geriatric syndromes, comorbidities, and the Drug Burden Index (DBI), a measure of potentially inappropriate polypharmacy, which quantifies exposure to medications with anticholinergic and sedative properties. 30-day emergency hospitalisation was defined as any hospitalisation related to any clinical event within 30 days of the index date. The model performance metrics included the area under the receiver operating characteristics curve (AUC-ROC) and the F1 score.

RESULTS: The AUC-ROC for the RF, XGB and LR models was 0.78, 0.86 and 0.61, respectively, signifying good discriminatory power. The DBI, mobility, fractures, falls, hazardous alcohol drinking and smoking were validated as important variables in predicting 30-day emergency hospitalisation.

CONCLUSIONS: This study validated important risk factors for predicting 30-day emergency hospitalisation. The validation of important risk factors will inform the development of future ML studies in geriatrics. Future research should prioritise the development of targeted interventions to address the risk factors validated in this study, ultimately improving patient outcomes and alleviating healthcare burdens.

Original languageEnglish
Article numberafaf156
JournalAge and Ageing
Volume54
Issue number6
Early online date6 Jun 2025
DOIs
Publication statusPublished - 30 Jun 2025

Data Availability Statement

More details concerning the UK Biobank and how to access the datasets can be found at www.ukbiobank.ac.uk/. The analytical code has been deposited in a GitHub repository: github.com/RobertOlender/ML_UKBiobank_emergency_30-day_hospitalisation to enable reproducibility.

Funding

This work was supported by the University Research Studentship Award at the University of Bath (project code EA-PA1231).

FundersFunder number
University of BathEA-PA1231

    Keywords

    • artificial intelligence
    • decision tree
    • hospitalisation
    • machine learning
    • older people
    • predictive modelling

    ASJC Scopus subject areas

    • Ageing
    • Geriatrics and Gerontology

    Fingerprint

    Dive into the research topics of 'Potentially inappropriate polypharmacy is an important predictor of 30-day emergency hospitalisation in older adults: a machine learning feature validation study'. Together they form a unique fingerprint.

    Cite this