Department of Mechatronics, School of Intelligent Systems, College of Interdisciplinary Science and Technology, University of Tehran, Tehran, Iran.
School of Science and Technology, Faculty of Science, Agriculture, Business and Law, University of New England, Armidale, NSW 2350, Australia.
Int J Med Inform. 2025 Jan;193:105659. doi: 10.1016/j.ijmedinf.2024.105659. Epub 2024 Oct 28.
Early and reliable prognostication in post-cardiac arrest patients remains challenging, with various factors linked to return of spontaneous circulation (ROSC), survival, and neurological results. Machine learning and deep learning models show promise in improving these predictions. This systematic review and meta-analysis evaluates how effective these approaches are in predicting clinical outcomes at different time points using structured data.
This study followed PRISMA guidelines, involving a comprehensive search across PubMed, Scopus, and Web of Science databases until March 2024. Studies aimed at predicting ROSC, survival (or mortality), and neurological outcomes after cardiac arrest through the application of machine learning or deep learning techniques with structured data were included. Data extraction followed the guidelines of the CHARMS checklist, and the bias risk was evaluated using PROBAST tool. Models reporting the AUC metric with 95 % confidence intervals were incorporated into the quantitative synthesis and meta-analysis.
After extracting 2,753 initial records, 41 studies met the inclusion criteria, yielding 97 machine learning and 16 deep learning models. The pooled AUC for predicting favorable neurological outcomes (CPC 1 or 2) at hospital discharge was 0.871 (95 % CI: 0.813 - 0.928) for machine learning models and 0.877 (95 % CI: 0.831-0.924) across deep learning algorithms. For survival prediction, this value was found to be 0.837 (95 % CI: 0.757-0.916). Considerable heterogeneity and high risk of bias were observed, mainly attributable to inadequate management of missing data and the absence of calibration plots. Most studies focused on pre-hospital factors, with age, sex, and initial arrest rhythm being the most frequent features.
Predictive models utilizing AI-based approaches, including machine and deep learning models exhibit enhanced effectiveness compared to previous regression algorithms, but significant heterogeneity and high risk of bias limit their dependability. Evaluating state-of-the-art deep learning models tailored for tabular data and their clinical generalizability can enhance outcome prediction after cardiac arrest.
心脏骤停后进行早期、可靠的预后预测仍然具有挑战性,各种因素与自主循环恢复(ROSC)、存活和神经结果相关。机器学习和深度学习模型在改善这些预测方面显示出了希望。本系统评价和荟萃分析评估了这些方法在使用结构化数据的不同时间点预测临床结果方面的有效性。
本研究遵循 PRISMA 指南,对 PubMed、Scopus 和 Web of Science 数据库进行了全面检索,截至 2024 年 3 月。纳入的研究旨在通过应用机器学习或深度学习技术并结合结构化数据来预测心脏骤停后 ROSC、存活(或死亡率)和神经结果。数据提取遵循 CHARMS 清单指南,使用 PROBAST 工具评估偏倚风险。纳入了报告 AUC 指标及其 95%置信区间的模型,并进行了定量综合和荟萃分析。
在提取了 2753 条初始记录后,有 41 项研究符合纳入标准,其中包含 97 个机器学习模型和 16 个深度学习模型。机器学习模型预测出院时良好神经结局(CPC 1 或 2)的汇总 AUC 为 0.871(95%CI:0.813-0.928),深度学习算法的 AUC 为 0.877(95%CI:0.831-0.924)。对于生存预测,这个值为 0.837(95%CI:0.757-0.916)。观察到明显的异质性和高偏倚风险,主要归因于对缺失数据的管理不当和缺乏校准图。大多数研究侧重于院前因素,最常见的特征是年龄、性别和初始骤停节律。
基于人工智能的预测模型,包括机器学习和深度学习模型,与以前的回归算法相比,表现出更高的效果,但显著的异质性和高偏倚风险限制了它们的可靠性。评估针对表格数据的最新深度学习模型及其临床通用性可以提高心脏骤停后的预后预测。