• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测重症监护病房死亡率的疾病严重程度评分与人工智能模型的比较:荟萃分析与文献综述

Comparison of Severity of Illness Scores and Artificial Intelligence Models That Are Predictive of Intensive Care Unit Mortality: Meta-analysis and Review of the Literature.

作者信息

Barboi Cristina, Tzavelis Andreas, Muhammad Lutfiyya NaQiyba

机构信息

Indiana University Purdue University, Regenstrief Institue, Indianapolis, IN, United States.

Medical Scientist Training Program, Feinberg School of Medicine, Chicago, IL, United States.

出版信息

JMIR Med Inform. 2022 May 31;10(5):e35293. doi: 10.2196/35293.

DOI:10.2196/35293
PMID:35639445
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9198821/
Abstract

BACKGROUND

Severity of illness scores-Acute Physiology and Chronic Health Evaluation, Simplified Acute Physiology Score, and Sequential Organ Failure Assessment-are current risk stratification and mortality prediction tools used in intensive care units (ICUs) worldwide. Developers of artificial intelligence or machine learning (ML) models predictive of ICU mortality use the severity of illness scores as a reference point when reporting the performance of these computational constructs.

OBJECTIVE

This study aimed to perform a literature review and meta-analysis of articles that compared binary classification ML models with the severity of illness scores that predict ICU mortality and determine which models have superior performance. This review intends to provide actionable guidance to clinicians on the performance and validity of ML models in supporting clinical decision-making compared with the severity of illness score models.

METHODS

Between December 15 and 18, 2020, we conducted a systematic search of PubMed, Scopus, Embase, and IEEE databases and reviewed studies published between 2000 and 2020 that compared the performance of binary ML models predictive of ICU mortality with the performance of severity of illness score models on the same data sets. We assessed the studies' characteristics, synthesized the results, meta-analyzed the discriminative performance of the ML and severity of illness score models, and performed tests of heterogeneity within and among studies.

RESULTS

We screened 461 abstracts, of which we assessed the full text of 66 (14.3%) articles. We included in the review 20 (4.3%) studies that developed 47 ML models based on 7 types of algorithms and compared them with 3 types of the severity of illness score models. Of the 20 studies, 4 (20%) were found to have a low risk of bias and applicability in model development, 7 (35%) performed external validation, 9 (45%) reported on calibration, 12 (60%) reported on classification measures, and 4 (20%) addressed explainability. The discriminative performance of the ML-based models, which was reported as AUROC, ranged between 0.728 and 0.99 and between 0.58 and 0.86 for the severity of illness score-based models. We noted substantial heterogeneity among the reported models and considerable variation among the AUROC estimates for both ML and severity of illness score model types.

CONCLUSIONS

ML-based models can accurately predict ICU mortality as an alternative to traditional scoring models. Although the range of performance of the ML models is superior to that of the severity of illness score models, the results cannot be generalized due to the high degree of heterogeneity. When presented with the option of choosing between severity of illness score or ML models for decision support, clinicians should select models that have been externally validated, tested in the practice environment, and updated to the patient population and practice environment.

TRIAL REGISTRATION

PROSPERO CRD42021203871; https://tinyurl.com/28v2nch8.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/a792ad83b618/medinform_v10i5e35293_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/877530dcd988/medinform_v10i5e35293_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/a6b8591b12ac/medinform_v10i5e35293_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/c7a09fc00431/medinform_v10i5e35293_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/531f90b6abb5/medinform_v10i5e35293_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/22ffa089217c/medinform_v10i5e35293_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/6b8c4c707e18/medinform_v10i5e35293_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/a792ad83b618/medinform_v10i5e35293_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/877530dcd988/medinform_v10i5e35293_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/a6b8591b12ac/medinform_v10i5e35293_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/c7a09fc00431/medinform_v10i5e35293_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/531f90b6abb5/medinform_v10i5e35293_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/22ffa089217c/medinform_v10i5e35293_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/6b8c4c707e18/medinform_v10i5e35293_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c319/9198821/a792ad83b618/medinform_v10i5e35293_fig7.jpg
摘要

背景

疾病严重程度评分——急性生理与慢性健康状况评估、简化急性生理学评分和序贯器官衰竭评估——是目前全球重症监护病房(ICU)中使用的风险分层和死亡率预测工具。预测ICU死亡率的人工智能或机器学习(ML)模型的开发者在报告这些计算结构的性能时,将疾病严重程度评分作为参考点。

目的

本研究旨在对比较二元分类ML模型与预测ICU死亡率的疾病严重程度评分的文章进行文献综述和荟萃分析,并确定哪些模型具有更优的性能。本综述旨在为临床医生提供关于ML模型在支持临床决策方面的性能和有效性的可操作指导,以便与疾病严重程度评分模型进行比较。

方法

在2020年12月15日至18日期间,我们对PubMed、Scopus、Embase和IEEE数据库进行了系统检索,并回顾了2000年至2020年期间发表的研究,这些研究在相同数据集上比较了预测ICU死亡率的二元ML模型与疾病严重程度评分模型的性能。我们评估了研究的特征,综合了结果,对ML模型和疾病严重程度评分模型的判别性能进行了荟萃分析,并对研究内部和研究之间的异质性进行了检验。

结果

我们筛选了461篇摘要,其中评估了66篇(14.3%)文章的全文。我们纳入综述的有20项(4.3%)研究,这些研究基于7种算法开发了47个ML模型,并将它们与3种疾病严重程度评分模型进行了比较。在这20项研究中,发现4项(20%)在模型开发中存在低偏倚风险和适用性,7项(35%)进行了外部验证,9项(45%)报告了校准情况,12项(60%)报告了分类指标,4项(20%)涉及可解释性。基于ML的模型的判别性能(报告为AUROC)在0.728至0.99之间,基于疾病严重程度评分的模型在0.58至0.86之间。我们注意到报告的模型之间存在实质性异质性,并且ML模型和疾病严重程度评分模型类型的AUROC估计值之间存在相当大的差异。

结论

基于ML的模型可以准确预测ICU死亡率,作为传统评分模型的替代方法。尽管ML模型的性能范围优于疾病严重程度评分模型,但由于高度异质性,结果不能一概而论。当在疾病严重程度评分或ML模型之间进行选择以获得决策支持时,临床医生应选择经过外部验证、在实践环境中测试并针对患者群体和实践环境进行更新的模型。

试验注册

PROSPERO CRD42021203871;https://tinyurl.com/28v2nch8 。

相似文献

1
Comparison of Severity of Illness Scores and Artificial Intelligence Models That Are Predictive of Intensive Care Unit Mortality: Meta-analysis and Review of the Literature.预测重症监护病房死亡率的疾病严重程度评分与人工智能模型的比较:荟萃分析与文献综述
JMIR Med Inform. 2022 May 31;10(5):e35293. doi: 10.2196/35293.
2
Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records.动态可解释机器学习预测 ICU 患者死亡率:电子患者记录中高频数据的回顾性研究。
Lancet Digit Health. 2020 Apr;2(4):e179-e191. doi: 10.1016/S2589-7500(20)30018-2. Epub 2020 Mar 12.
3
Simplified Mortality Score for the Intensive Care Unit (SMS-ICU): protocol for the development and validation of a bedside clinical prediction rule.重症监护病房简化死亡率评分(SMS-ICU):床边临床预测规则的制定与验证方案
BMJ Open. 2017 Mar 9;7(3):e015339. doi: 10.1136/bmjopen-2016-015339.
4
Artificial Intelligence for the Prediction of In-Hospital Clinical Deterioration: A Systematic Review.用于预测住院期间临床病情恶化的人工智能:一项系统综述。
Crit Care Explor. 2022 Aug 26;4(9):e0744. doi: 10.1097/CCE.0000000000000744. eCollection 2022 Sep.
5
Predictive Performance of the Simplified Acute Physiology Score (SAPS) II and the Initial Sequential Organ Failure Assessment (SOFA) Score in Acutely Ill Intensive Care Patients: Post-Hoc Analyses of the SUP-ICU Inception Cohort Study.简化急性生理学评分(SAPS)II和初始序贯器官衰竭评估(SOFA)评分对急重症监护患者的预测性能:SUP-ICU初始队列研究的事后分析
PLoS One. 2016 Dec 22;11(12):e0168948. doi: 10.1371/journal.pone.0168948. eCollection 2016.
6
Evaluation of SOFA-based models for predicting mortality in the ICU: A systematic review.评估基于序贯器官衰竭评估(SOFA)的模型预测重症监护病房(ICU)死亡率:一项系统评价。
Crit Care. 2008;12(6):R161. doi: 10.1186/cc7160. Epub 2008 Dec 17.
7
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
8
Performance of critical care prognostic scoring systems in low and middle-income countries: a systematic review.中低收入国家重症监护预后评分系统的表现:系统评价。
Crit Care. 2018 Jan 26;22(1):18. doi: 10.1186/s13054-017-1930-8.
9
Survival prediction in intensive-care units based on aggregation of long-term disease history and acute physiology: a retrospective study of the Danish National Patient Registry and electronic patient records.基于长期疾病史和急性生理学数据聚合的重症监护病房患者生存预测:丹麦国家患者登记处和电子患者记录的回顾性研究。
Lancet Digit Health. 2019 Jun;1(2):e78-e89. doi: 10.1016/S2589-7500(19)30024-X. Epub 2019 May 23.
10
Artificial Intelligence-Based Models for Prediction of Mortality in ICU Patients: A Scoping Review.基于人工智能的重症监护病房患者死亡率预测模型:一项范围综述
J Intensive Care Med. 2024 Aug 16:8850666241277134. doi: 10.1177/08850666241277134.

引用本文的文献

1
Discriminatory Performance of APACHE II Score and the Prediction of Mortality within the ICU in Patients with Sepsis Admitted to the ICU.急性生理学及慢性健康状况评分系统II(APACHE II)在入住重症监护病房(ICU)的脓毒症患者中的鉴别表现及ICU内死亡率预测
Mater Sociomed. 2025;37(2):153-158. doi: 10.5455/msm.2025.37.153-158.
2
Predicting outcomes using neural networks in the intensive care unit.在重症监护病房使用神经网络预测结果。
World J Clin Cases. 2025 Apr 16;13(11):100966. doi: 10.12998/wjcc.v13.i11.100966.
3
Machine Learning Models for the Early Real-Time Prediction of Deterioration in Intensive Care Units-A Novel Approach to the Early Identification of High-Risk Patients.

本文引用的文献

1
Guidelines and quality criteria for artificial intelligence-based prediction models in healthcare: a scoping review.医疗保健中基于人工智能的预测模型的指南和质量标准:一项范围综述
NPJ Digit Med. 2022 Jan 10;5(1):2. doi: 10.1038/s41746-021-00549-7.
2
Risk of bias in studies on prediction models developed using supervised machine learning techniques: systematic review.基于监督机器学习技术开发的预测模型研究中的偏倚风险:系统评价。
BMJ. 2021 Oct 20;375:n2281. doi: 10.1136/bmj.n2281.
3
Performance Metrics for the Comparative Analysis of Clinical Risk Prediction Models Employing Machine Learning.
用于重症监护病房病情恶化早期实时预测的机器学习模型——一种早期识别高危患者的新方法
J Clin Med. 2025 Jan 8;14(2):350. doi: 10.3390/jcm14020350.
4
Prognosticating the outcome of intensive care in older patients-a narrative review.预测老年患者重症监护的结果——一项叙述性综述。
Ann Intensive Care. 2024 Jun 22;14(1):97. doi: 10.1186/s13613-024-01330-1.
5
AI Quality Standards in Health Care: Rapid Umbrella Review.医疗保健中的人工智能质量标准:快速伞式综述。
J Med Internet Res. 2024 May 22;26:e54705. doi: 10.2196/54705.
6
A Systematic Review of the Application of Artificial Intelligence in Nursing Care: Where are We, and What's Next?人工智能在护理中的应用系统评价:我们现状如何,未来走向何方?
J Multidiscip Healthc. 2024 Apr 12;17:1603-1616. doi: 10.2147/JMDH.S459946. eCollection 2024.
7
Prevalence and risk factors of exposure keratopathy among critically ill patients: A systematic review and meta-analysis.危重症患者暴露性角膜炎的患病率及危险因素:系统评价和荟萃分析。
Nurs Open. 2024 Jan;11(1):e2061. doi: 10.1002/nop2.2061.
8
Machine Learning for Benchmarking Critical Care Outcomes.用于重症监护结果基准测试的机器学习
Healthc Inform Res. 2023 Oct;29(4):301-314. doi: 10.4258/hir.2023.29.4.301. Epub 2023 Oct 31.
9
The Quality and Utility of Artificial Intelligence in Patient Care.人工智能在患者护理中的质量和效用。
Dtsch Arztebl Int. 2023 Jul 10;120(27-28):463-469. doi: 10.3238/arztebl.m2023.0124.
10
D-Dimer as a Prognostic Factor in a Tertiary Center Intensive Coronary Care Unit.D-二聚体作为三级重症监护冠状动脉护理单元的预后因素。
Clin Appl Thromb Hemost. 2022 Jan-Dec;28:10760296221110879. doi: 10.1177/10760296221110879.
机器学习在临床风险预测模型比较分析中的性能指标
Circ Cardiovasc Qual Outcomes. 2021 Oct;14(10):e007526. doi: 10.1161/CIRCOUTCOMES.120.007526. Epub 2021 Oct 4.
4
Generalization in Clinical Prediction Models: The Blessing and Curse of Measurement Indicator Variables.临床预测模型中的泛化:测量指标变量的幸与不幸
Crit Care Explor. 2021 Jun 25;3(7):e0453. doi: 10.1097/CCE.0000000000000453. eCollection 2021 Jul.
5
Comparison of machine learning and logistic regression models in predicting acute kidney injury: A systematic review and meta-analysis.机器学习和逻辑回归模型在预测急性肾损伤中的比较:系统评价和荟萃分析。
Int J Med Inform. 2021 Jul;151:104484. doi: 10.1016/j.ijmedinf.2021.104484. Epub 2021 May 8.
6
The PRISMA 2020 statement: An updated guideline for reporting systematic reviews.PRISMA 2020 声明:系统评价报告的更新指南。
Int J Surg. 2021 Apr;88:105906. doi: 10.1016/j.ijsu.2021.105906. Epub 2021 Mar 29.
7
Performance of intensive care unit severity scoring systems across different ethnicities in the USA: a retrospective observational study.美国不同种族重症监护病房严重程度评分系统的表现:一项回顾性观察研究。
Lancet Digit Health. 2021 Apr;3(4):e241-e249. doi: 10.1016/S2589-7500(21)00022-4.
8
Systematic literature review of machine learning methods used in the analysis of real-world data for patient-provider decision making.系统文献综述机器学习方法在分析真实世界数据中的应用,以支持患者与提供者的决策。
BMC Med Inform Decis Mak. 2021 Feb 15;21(1):54. doi: 10.1186/s12911-021-01403-2.
9
Survival prediction in intensive-care units based on aggregation of long-term disease history and acute physiology: a retrospective study of the Danish National Patient Registry and electronic patient records.基于长期疾病史和急性生理学数据聚合的重症监护病房患者生存预测:丹麦国家患者登记处和电子患者记录的回顾性研究。
Lancet Digit Health. 2019 Jun;1(2):e78-e89. doi: 10.1016/S2589-7500(19)30024-X. Epub 2019 May 23.
10
Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis.多变量逻辑回归与其他机器学习算法在孕期护理预后预测研究中的比较:系统评价与荟萃分析
JMIR Med Inform. 2020 Nov 17;8(11):e16503. doi: 10.2196/16503.