Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada.
DeGroote School of Business, McMaster University, Hamilton, ON, Canada.
J Med Internet Res. 2021 Feb 4;23(2):e25187. doi: 10.2196/25187.
Timely identification of patients at a high risk of clinical deterioration is key to prioritizing care, allocating resources effectively, and preventing adverse outcomes. Vital signs-based, aggregate-weighted early warning systems are commonly used to predict the risk of outcomes related to cardiorespiratory instability and sepsis, which are strong predictors of poor outcomes and mortality. Machine learning models, which can incorporate trends and capture relationships among parameters that aggregate-weighted models cannot, have recently been showing promising results.
This study aimed to identify, summarize, and evaluate the available research, current state of utility, and challenges with machine learning-based early warning systems using vital signs to predict the risk of physiological deterioration in acutely ill patients, across acute and ambulatory care settings.
PubMed, CINAHL, Cochrane Library, Web of Science, Embase, and Google Scholar were searched for peer-reviewed, original studies with keywords related to "vital signs," "clinical deterioration," and "machine learning." Included studies used patient vital signs along with demographics and described a machine learning model for predicting an outcome in acute and ambulatory care settings. Data were extracted following PRISMA, TRIPOD, and Cochrane Collaboration guidelines.
We identified 24 peer-reviewed studies from 417 articles for inclusion; 23 studies were retrospective, while 1 was prospective in nature. Care settings included general wards, intensive care units, emergency departments, step-down units, medical assessment units, postanesthetic wards, and home care. Machine learning models including logistic regression, tree-based methods, kernel-based methods, and neural networks were most commonly used to predict the risk of deterioration. The area under the curve for models ranged from 0.57 to 0.97.
In studies that compared performance, reported results suggest that machine learning-based early warning systems can achieve greater accuracy than aggregate-weighted early warning systems but several areas for further research were identified. While these models have the potential to provide clinical decision support, there is a need for standardized outcome measures to allow for rigorous evaluation of performance across models. Further research needs to address the interpretability of model outputs by clinicians, clinical efficacy of these systems through prospective study design, and their potential impact in different clinical settings.
及时识别临床恶化风险高的患者是优先提供护理、有效分配资源和预防不良结局的关键。基于生命体征的综合加权预警系统常用于预测与心肺不稳定和脓毒症相关结局的风险,这些是不良结局和死亡率的强预测指标。最近,机器学习模型在捕捉综合加权模型无法捕捉的参数趋势和关系方面显示出了有前景的结果。
本研究旨在识别、总结和评估基于生命体征的机器学习预警系统在预测急性和门诊护理环境中急性病患者生理恶化风险方面的现有研究、当前效用状态和挑战。
使用与“生命体征”、“临床恶化”和“机器学习”相关的关键词,在 PubMed、CINAHL、Cochrane 图书馆、Web of Science、Embase 和 Google Scholar 中搜索同行评议的原始研究。纳入的研究使用患者生命体征以及人口统计学数据,并描述了用于预测急性和门诊护理环境中结局的机器学习模型。数据按照 PRISMA、TRIPOD 和 Cochrane 协作指南进行提取。
我们从 417 篇文章中确定了 24 篇符合纳入标准的同行评议研究;23 项研究为回顾性,1 项为前瞻性。护理环境包括普通病房、重症监护病房、急诊科、降级病房、医疗评估病房、麻醉后病房和家庭护理。最常用于预测恶化风险的机器学习模型包括逻辑回归、基于树的方法、核方法和神经网络。模型的曲线下面积范围为 0.57 至 0.97。
在比较性能的研究中,报告的结果表明,基于机器学习的预警系统可以比综合加权预警系统实现更高的准确性,但确定了几个进一步研究的领域。虽然这些模型有可能提供临床决策支持,但需要标准化的结局衡量标准,以便在模型之间进行严格的性能评估。还需要进一步研究解决临床医生对模型输出的解释能力、这些系统的临床疗效通过前瞻性研究设计以及它们在不同临床环境中的潜在影响。