Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:2472-2475. doi: 10.1109/EMBC46164.2021.9630317.
The increasing availability of electronic health records and administrative data and the adoption of computer-based technologies in healthcare have significantly focused on medical informatics. Sepsis is a time-critical condition with high mortality, yet it is often not identified in a timely fashion. The early detection and diagnosis of sepsis can increase the likelihood of survival and improve long-term outcomes for patients. In this paper, we use SHapley Additive exPlanations (SHAP) analysis to explore the variables most highly associated with developing sepsis in patients and evaluating different supervised learning models for classification. To develop our predictive models, we used the data collected after the first and the fifth hour of admission and evaluated the contribution of different features to the prediction results for both time intervals. The results of our study show that, while there is a high level of missing data during the early stages of admission, this data can be effectively utilized for the early prediction of sepsis. We also found a high level of inconsistency between the contributing features at different stages of admission, which should be considered when developing machine learning models.
电子健康记录和管理数据的日益普及,以及医疗保健中计算机技术的采用,都使得医学信息学成为关注的焦点。败血症是一种时间敏感且死亡率高的病症,但往往不能及时得到识别。早期发现和诊断败血症可以提高患者的生存率,并改善其长期预后。在本文中,我们使用 SHapley Additive exPlanations (SHAP) 分析来探索与患者发生败血症最相关的变量,并评估不同的监督学习模型进行分类。为了开发我们的预测模型,我们使用了入院后第一小时和第五小时收集的数据,并评估了不同特征对两个时间间隔的预测结果的贡献。我们的研究结果表明,虽然在入院早期存在大量缺失数据,但这些数据可以有效地用于败血症的早期预测。我们还发现,在不同入院阶段,特征的贡献存在高度不一致性,在开发机器学习模型时应考虑这一点。