Abstract
Early detection of sepsis is key to ensure timely clinical intervention. Since very few end-to-end pipelines are publicly available, fair comparisons between methodologies are difficult if not impossible. Progress is further limited by discrepancies in the reconstruction of sepsis onset time. This retrospective cohort study highlights the variation in performance of predictive models under three subtly different interpretations of sepsis onset from the sepsis-III definition and compares this against inter-model differences. The models are chosen to cover tree-based, deep learning, and survival analysis methods. Using the MIMIC-III database, between 867 and 2178 intensive care unit admissions with sepsis were identified, depending on the onset definition. We show that model performance can be more sensitive to differences in the definition of sepsis onset than to the model itself. Given a fixed sepsis definition, the best performing method had a gain of 1–5% in the area under the receiver operating characteristic (AUROC). However, the choice of onset time can cause a greater effect, with variation of 0–6% in AUROC. We illustrate that misleading conclusions can be drawn if models are compared without consideration of the sepsis definition used which emphasizes the need for a standardized definition for sepsis onset.
Original language | English |
---|---|
Article number | 1920 |
Number of pages | 10 |
Journal | Scientific Reports |
Volume | 14 |
Issue number | 1 |
DOIs | |
Publication status | Published - 22 Jan 2024 |
Bibliographical note
Data availability: The data we used in this paper is extracted from the MIMIC-III database. Once the required training and credentials are obtained, this dataset is accessible from PhysioNet at https://physionet.org/content/mimiciii/1.4/. The MIMIC-III project was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the project did not impact clinical care and all protected health information was deidentified.Funding
S.C., J.F., P.F., H.L., T.L., S.M., J.M., H.N., E.P., Y.W., and L.Y. are supported by the Alan Turing Institute under the EPSRC grant EP/N510129/1. J.F., T.L., S.M., H.N., and Y.W. are supported by the EPSRC under the program grant EP/S026347/1. T. L. is supported in part by the Data Centric Engineering Programme (under the Lloyd’s Register Foundation grant G0095) and the Office of National Statistics Programme (funded by the UK Government) and in part by the Hong Kong Innovation and Technology Commission (InnoHK Project CIMDA). T.L. and P.F. are supported by the Defence and Security Programme at the Alan Turing Institute, funded by the UK Government. H.L. is supported by UCL-CSC scholarship by University College London and the China Scholarship Council (CSC) from the Ministry of Education of P.R. China. L.Y. and J.M. are supported by EPSRC grant EP/L015803/1 and L.Y. is also supported by the Clarendon Fund. E.P. is supported by an NIHR clinical lectureship. B. W. is supported by “The Harvard Program in Precision Psychiatry” under the funding of Harvard Medical School and the Sang Foundation.
Funders | Funder number |
---|---|
Data Centric Engineering Programme | |
InnoHK | |
Office of National Statistics Programme | |
Sang Foundation | |
UCL-CSC | |
Harvard University | |
Lloyd's Register Foundation | G0095 |
Department for Business, Energy & Industrial Strategy | |
The Alan Turing Institute | |
Department of Transport, UK Government | |
Department for Enterprise, Trade and Investment, UK Government | Security Programme at the Alan Turing Institute, Defence |
Department of Agriculture, Environment and Rural Affairs, UK Government | |
University of Oxford | |
Engineering and Physical Sciences Research Council | EP/N510129/1, EP/S026347/1 |
National Institute for Health and Care Research | |
Department for Environment, Food and Rural Affairs, UK Government | the Office of National Statistics Programme |
University of Oxford | |
Department of Health, Social Services and Public Safety, UK Government | |
University College London | |
University of Oxford | |
Ministry of Education China | EP/L015803/1 |
China Scholarship Council | |
Hong Kong Special Administrative Region of the People's Republic of China | |
Clarendon Fund | |
Ministry of Agriculture, Fisheries and Food, UK Government |
Keywords
- Sepsis
- Sepsis-III
- MIMIC-III
- Sepsis labelling
- Sepsis onset
- Machine learning
ASJC Scopus subject areas
- General