Sign In    Register

World Journal of Emergency Medicine ›› 2024, Vol. 15 ›› Issue (5): 379-385.doi: 10.5847/wjem.j.1920-8642.2024.074

• Original Articles • Previous Articles     Next Articles

Prediction of sepsis within 24 hours at the triage stage in emergency departments using machine learning

Jingyuan Xie1, Jiandong Gao1,2, Mutian Yang3, Ting Zhang4, Yecheng Liu4,5, Yutong Chen1, Zetong Liu1, Qimin Mei4, Zhimao Li4, Huadong Zhu4(), Ji Wu1,2()   

  1. 1Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
    2Center for Big Data and Clinical Research, Institute for Precision Medicine, Tsinghua University, Beijing 100084, China
    3Institute of Medical Technology, Peking University Health Science Center, Beijing 100091, China
    4Emergency Department, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing 100005, China
    5Department of Health Care, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100005, China
  • Received:2024-03-29 Online:2024-09-09 Published:2024-09-01
  • Contact: Ji Wu, Email: wuji_ee@tsinghua.edu.cn; Huadong Zhu, Email: drzhuhd@sina.com

Abstract:

BACKGROUND Sepsis is one of the main causes of mortality in intensive care units (ICUs). Early prediction is critical for reducing injury. As approximately 36% of sepsis occur within 24 h after emergency department (ED) admission in Medical Information Mart for Intensive Care (MIMIC-IV), a prediction system for the ED triage stage would be helpful. Previous methods such as the quick Sequential Organ Failure Assessment (qSOFA) are more suitable for screening than for prediction in the ED, and we aimed to find a light-weight, convenient prediction method through machine learning.

METHODS We accessed the MIMIC-IV for sepsis patient data in the EDs. Our dataset comprised demographic information, vital signs, and synthetic features. Extreme Gradient Boosting (XGBoost) was used to predict the risk of developing sepsis within 24 h after ED admission. Additionally, SHapley Additive exPlanations (SHAP) was employed to provide a comprehensive interpretation of the model's results. Ten percent of the patients were randomly selected as the testing set, while the remaining patients were used for training with 10-fold cross-validation.

RESULTS For 10-fold cross-validation on 14,957 samples, we reached an accuracy of 84.1%±0.3% and an area under the receiver operating characteristic (ROC) curve of 0.92±0.02. The model achieved similar performance on the testing set of 1,662 patients. SHAP values showed that the five most important features were acuity, arrival transportation, age, shock index, and respiratory rate.

CONCLUSION Machine learning models such as XGBoost may be used for sepsis prediction using only a small amount of data conveniently collected in the ED triage stage. This may help reduce workload in the ED and warn medical workers against the risk of sepsis in advance.

Key words: Sepsis, Machine learning, Emergency department, Triage, Informatics