• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于预测病房患者临床病情恶化的多模态深度学习方法比较:观察性队列研究

Comparison of Multimodal Deep Learning Approaches for Predicting Clinical Deterioration in Ward Patients: Observational Cohort Study.

作者信息

Kotula Charles A, Martin Jennie, Carey Kyle A, Edelson Dana P, Dligach Dmitriy, Mayampurath Anoop, Afshar Majid, Churpek Matthew M

机构信息

Department of Medicine, University of Wisconsin-Madison, 610 Walnut St, Madison, WI, 53792, United States, 1 608-262-9564.

Department of Medicine, University of Chicago, Chicago, IL, United States.

出版信息

J Med Internet Res. 2025 Jun 11;27:e75340. doi: 10.2196/75340.

DOI:10.2196/75340
PMID:40499139
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12176310/
Abstract

BACKGROUND

Implementing machine learning models to identify clinical deterioration in the wards is associated with decreased morbidity and mortality. However, these models have high false positive rates and only use structured data.

OBJECTIVE

We aimed to compare models with and without information from clinical notes for predicting deterioration.

METHODS

Adults admitted to the wards at the University of Chicago (development cohort) and University of Wisconsin-Madison (external validation cohort) were included. Predictors consisted of structured and unstructured variables extracted from notes as concept unique identifiers (CUIs). We parameterized CUIs in 5 ways: standard tokenization (ST), International Classification of Diseases rollup using tokenization (ICDR-T), International Classification of Diseases rollup using binary variables (ICDR-BV), concept unique identifiers as SapBERT embedding (SE), and concept unique identifier clustering using SapBERT embeddings (CC). Each parameterization method combined with structured data and each structured data-only method were compared for predicting intensive care unit transfer or death in the next 24 hours using deep recurrent neural networks.

RESULTS

The development (University of Chicago) cohort included 284,302 patients, while the external validation (University of Wisconsin-Madison) cohort included 248,055 patients. In total, 4.9% (n=26,281) of patients experienced the outcome. The SE model achieved the highest area under the precision-recall curve (0.208), followed by CC (0.199) and the structured-only model (0.199), ICDR-BV (0.194), ICDR-T (0.166), and ST (0.158). The CC and structured-only models achieved the highest area under the receiver operating characteristic (0.870), followed by ICDR-T (0.867), ICDR-BV (0.866), ST (0.860), and SE (0.859). Regarding sensitivity and positive predictive value, the CC model achieved the greatest positive predictive value (12.53%) and sensitivity (52.15%) at the cutoff that flagged 5% of the observations in the test set. At the 15% cutoff, the ICDR-T, CC, and ICDR-BV models tied for the highest positive predictive value at 5.67%, while their sensitivities were 70.95%, 70.92%, and 70.86%, respectively. All models were well calibrated, achieving Brier scores in the range of 0.011-0.012. The modified integrated gradients method revealed that CUIs corresponding to terms such as "NPO - nothing by mouth," "chemotherapy," "transplanted tissue," and "dialysis procedure" were most predictive of deterioration.

CONCLUSIONS

A multimodal model combining structured data with embeddings using SapBERT had the highest area under the precision-recall curve, but performance was similar between models with and without CUIs. Although the addition of CUIs from notes to structured data did not meaningfully improve model performance for predicting clinical deterioration, models using CUIs could provide clinicians with relevant information and additional clinical context for supporting decision-making.

摘要

背景

在病房中应用机器学习模型来识别临床病情恶化与发病率和死亡率的降低相关。然而,这些模型的假阳性率很高,并且仅使用结构化数据。

目的

我们旨在比较有无临床记录信息的模型在预测病情恶化方面的表现。

方法

纳入芝加哥大学(开发队列)和威斯康星大学麦迪逊分校(外部验证队列)病房收治的成年患者。预测变量包括从病历中提取的结构化和非结构化变量,以概念唯一标识符(CUI)表示。我们以5种方式对CUI进行参数化:标准词元化(ST)、使用词元化的国际疾病分类汇总(ICDR-T)、使用二元变量的国际疾病分类汇总(ICDR-BV)、作为SapBERT嵌入的概念唯一标识符(SE)以及使用SapBERT嵌入的概念唯一标识符聚类(CC)。使用深度循环神经网络,比较每种参数化方法与结构化数据相结合的情况以及每种仅使用结构化数据的方法,以预测未来24小时内的重症监护病房转诊或死亡情况。

结果

开发队列(芝加哥大学)包括284,302例患者,而外部验证队列(威斯康星大学麦迪逊分校)包括248,055例患者。共有4.9%(n = 26,281)的患者出现了该结局。SE模型在精确率-召回率曲线下面积最高(0.208),其次是CC(0.199)和仅使用结构化数据的模型(0.199)、ICDR-BV(0.194)、ICDR-T(0.166)和ST(0.158)。CC模型和仅使用结构化数据的模型在受试者工作特征曲线下面积最高(0.870),其次是ICDR-T(0.867)、ICDR-BV(0.866)、ST(0.860)和SE(0.859)。关于敏感性和阳性预测值,在标记测试集中5%观察值的截断值时,CC模型的阳性预测值最高(12.53%)且敏感性最高(52.15%)。在15%的截断值时,ICDR-T、CC和ICDR-BV模型的阳性预测值并列最高,为5.67%,而它们的敏感性分别为70.95%、70.92%和70.86%。所有模型校准良好,布里尔分数在0.011 - 0.012范围内。改进的综合梯度法显示,与“禁食 - 无口进食”“化疗”“移植组织”和“透析程序”等术语对应的CUI对病情恶化的预测性最强。

结论

将结构化数据与使用SapBERT的嵌入相结合的多模态模型在精确率-召回率曲线下面积最高,但有无CUI的模型之间性能相似。虽然将病历中的CUI添加到结构化数据中并不能显著提高预测临床病情恶化的模型性能,但使用CUI的模型可为临床医生提供相关信息和额外的临床背景以支持决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8243/12176310/5efaabd8ceed/jmir-v27-e75340-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8243/12176310/b763e69cb8b6/jmir-v27-e75340-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8243/12176310/dbfa1e76b2bd/jmir-v27-e75340-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8243/12176310/5efaabd8ceed/jmir-v27-e75340-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8243/12176310/b763e69cb8b6/jmir-v27-e75340-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8243/12176310/dbfa1e76b2bd/jmir-v27-e75340-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8243/12176310/5efaabd8ceed/jmir-v27-e75340-g003.jpg

相似文献

1
Comparison of Multimodal Deep Learning Approaches for Predicting Clinical Deterioration in Ward Patients: Observational Cohort Study.用于预测病房患者临床病情恶化的多模态深度学习方法比较:观察性队列研究
J Med Internet Res. 2025 Jun 11;27:e75340. doi: 10.2196/75340.
2
Comparison of Multimodal Deep Learning Approaches for Predicting Clinical Deterioration in Ward Patients.用于预测病房患者临床病情恶化的多模态深度学习方法比较
medRxiv. 2025 Mar 7:2025.03.06.25322855. doi: 10.1101/2025.03.06.25322855.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
5
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.
8
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
9
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
10
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

本文引用的文献

1
Multicenter Development and Prospective Validation of eCARTv5: A Gradient-Boosted Machine-Learning Early Warning Score.eCARTv5的多中心开发与前瞻性验证:一种梯度提升机器学习早期预警评分
Crit Care Explor. 2025 Mar 26;7(4):e1232. doi: 10.1097/CCE.0000000000001232. eCollection 2025 Apr 1.
2
Improving the Prognostic Evaluation Precision of Hospital Outcomes for Heart Failure Using Admission Notes and Clinical Tabular Data: Multimodal Deep Learning Model.利用入院记录和临床表格数据提高心力衰竭住院结局预后评估的精准度:多模态深度学习模型。
J Med Internet Res. 2024 May 2;26:e54363. doi: 10.2196/54363.
3
Multimodal risk prediction with physiological signals, medical images and clinical notes.
利用生理信号、医学图像和临床记录进行多模态风险预测。
Heliyon. 2024 Feb 28;10(5):e26772. doi: 10.1016/j.heliyon.2024.e26772. eCollection 2024 Mar 15.
4
Development and external validation of multimodal postoperative acute kidney injury risk machine learning models.多模式术后急性肾损伤风险机器学习模型的开发与外部验证
JAMIA Open. 2023 Dec 22;6(4):ooad109. doi: 10.1093/jamiaopen/ooad109. eCollection 2023 Dec.
5
Systematic review and longitudinal analysis of implementing Artificial Intelligence to predict clinical deterioration in adult hospitals: what is known and what remains uncertain.系统评价和纵向分析在成人医院实施人工智能预测临床恶化:已知和未知。
J Am Med Inform Assoc. 2024 Jan 18;31(2):509-524. doi: 10.1093/jamia/ocad220.
6
Combining clinical notes with structured electronic health records enhances the prediction of mental health crises.将临床笔记与结构化电子健康记录相结合,可以提高心理健康危机预测的准确性。
Cell Rep Med. 2023 Nov 21;4(11):101260. doi: 10.1016/j.xcrm.2023.101260. Epub 2023 Oct 31.
7
Deep learning for deterioration prediction of COVID-19 patients based on time-series of three vital signs.基于生命体征时间序列的 COVID-19 患者恶化预测的深度学习方法。
Sci Rep. 2023 Jun 20;13(1):9968. doi: 10.1038/s41598-023-37013-3.
8
Deployment of Real-time Natural Language Processing and Deep Learning Clinical Decision Support in the Electronic Health Record: Pipeline Implementation for an Opioid Misuse Screener in Hospitalized Adults.电子健康记录中实时自然语言处理和深度学习临床决策支持的应用:成年住院患者阿片类药物滥用筛查器的流程实施
JMIR Med Inform. 2023 Apr 20;11:e44977. doi: 10.2196/44977.
9
Comparison of Natural Language Processing of Clinical Notes With a Validated Risk-Stratification Tool to Predict Severe Maternal Morbidity.临床记录的自然语言处理与验证的风险分层工具预测严重产妇发病率的比较。
JAMA Netw Open. 2022 Oct 3;5(10):e2234924. doi: 10.1001/jamanetworkopen.2022.34924.
10
The Impact of a Machine Learning Early Warning Score on Hospital Mortality: A Multicenter Clinical Intervention Trial.机器学习早期预警评分对医院死亡率的影响:一项多中心临床干预试验。
Crit Care Med. 2022 Sep 1;50(9):1339-1347. doi: 10.1097/CCM.0000000000005492. Epub 2022 Aug 15.