• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用XGBoost机器学习和SHAP解释法预测癌症合并急性肺栓塞患者院内死亡的模型

A predictive model for hospital death in cancer patients with acute pulmonary embolism using XGBoost machine learning and SHAP interpretation.

作者信息

Yuan Zhen-Nan, Xue Yu-Juan, Wang Hai-Jun, Qu Shi-Ning, Huang Chu-Lin, Wang Hao, Zhang Hao, Zhang Min-Ze, Xing Xue-Zhong

机构信息

Department of Intensive Care Unit, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100021, China.

Department of Pediatrics, Peking University People's Hospital, Peking University, Beijing, China.

出版信息

Sci Rep. 2025 May 25;15(1):18268. doi: 10.1038/s41598-025-02072-1.

DOI:10.1038/s41598-025-02072-1
PMID:40414906
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12104392/
Abstract

The prediction of in-hospital mortality in cancer patients with acute pulmonary embolism (APE) remains a significant clinical challenge. This study aimed to develop and validate a machine learning model using XGBoost to predict in-hospital mortality in this vulnerable population. A retrospective cohort study was conducted using the MIMIC-IV 2.2 database and external data from the intensive care unit of Cancer hospital, Chinese Academy of Medical Sciences, collected between May 1, 2021, and April 30, 2023. A total of 448 cancer patients with APE were included from the MIMIC-IV 2.2 database, divided into a training set (70%, n = 314) and an internal validation set (30%, n = 134). An external validation cohort consisted of 56 patients. An XGBoost model was trained and the SHAP (SHapley Additive Explanations) method was used to identify the top 10 predictors of in-hospital mortality. These predictors included Glasgow Coma Scale (GCS) score, albumin, platelet count, age, serum creatinine, hemoglobin, presence of metastasis, lactate, creatine kinase (CK), and types of cancer. The XGBoost model achieved an area under the ROC curve (AUC) of 0.806 (95% CI: 0.717-0.896) in the internal validation set and 0.724 (95% CI: 0.686-0.901) in the external validation set. Calibration curves indicated good model fit, and decision curve analysis (DCA) demonstrated a high clinical benefit across both the internal and external validation cohorts. The XGBoost model, leveraging SHAP for interpretation, effectively predicts in-hospital mortality in cancer patients with APE. This model provides valuable insights for clinical decision-making and has the potential to improve patient outcomes through early intervention and personalized treatment strategies. Further validation in diverse clinical settings is warranted to confirm its generalizability.

摘要

预测癌症合并急性肺栓塞(APE)患者的院内死亡率仍然是一项重大的临床挑战。本研究旨在开发并验证一种使用XGBoost的机器学习模型,以预测这一脆弱人群的院内死亡率。我们进行了一项回顾性队列研究,使用了MIMIC-IV 2.2数据库以及中国医学科学院肿瘤医院重症监护室2021年5月1日至2023年4月30日期间收集的外部数据。从MIMIC-IV 2.2数据库中纳入了448例癌症合并APE患者,分为训练集(70%,n = 314)和内部验证集(30%,n = 134)。一个外部验证队列由56例患者组成。训练了一个XGBoost模型,并使用SHAP(SHapley加性解释)方法来确定院内死亡率的前10个预测因素。这些预测因素包括格拉斯哥昏迷量表(GCS)评分、白蛋白、血小板计数、年龄、血清肌酐、血红蛋白、转移情况、乳酸、肌酸激酶(CK)以及癌症类型。XGBoost模型在内部验证集中的ROC曲线下面积(AUC)为0.806(95%CI:0.717 - 0.896),在外部验证集中为0.724(95%CI:0.686 - 0.901)。校准曲线表明模型拟合良好,决策曲线分析(DCA)显示在内部和外部验证队列中均具有较高的临床获益。利用SHAP进行解释的XGBoost模型能够有效预测癌症合并APE患者的院内死亡率。该模型为临床决策提供了有价值的见解,并有可能通过早期干预和个性化治疗策略改善患者预后。有必要在不同临床环境中进行进一步验证以确认其可推广性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c48/12104392/de17c419db06/41598_2025_2072_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c48/12104392/eb75ca4bbe80/41598_2025_2072_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c48/12104392/e5ef75ec892a/41598_2025_2072_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c48/12104392/b6d5105f5b59/41598_2025_2072_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c48/12104392/de17c419db06/41598_2025_2072_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c48/12104392/eb75ca4bbe80/41598_2025_2072_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c48/12104392/e5ef75ec892a/41598_2025_2072_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c48/12104392/b6d5105f5b59/41598_2025_2072_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c48/12104392/de17c419db06/41598_2025_2072_Fig4_HTML.jpg

相似文献

1
A predictive model for hospital death in cancer patients with acute pulmonary embolism using XGBoost machine learning and SHAP interpretation.一种使用XGBoost机器学习和SHAP解释法预测癌症合并急性肺栓塞患者院内死亡的模型
Sci Rep. 2025 May 25;15(1):18268. doi: 10.1038/s41598-025-02072-1.
2
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
3
[Predicting Intensive Care Unit Mortality in Patients With Heart Failure Combined With Acute Kidney Injury Using an Interpretable Machine Learning Model: A Retrospective Cohort Study].[使用可解释机器学习模型预测心力衰竭合并急性肾损伤患者的重症监护病房死亡率:一项回顾性队列研究]
Sichuan Da Xue Xue Bao Yi Xue Ban. 2025 Jan 20;56(1):183-190. doi: 10.12182/20250160507.
4
Prediction of STAS in lung adenocarcinoma with nodules ≤ 2 cm using machine learning: a multicenter retrospective study.使用机器学习预测直径≤2 cm的肺腺癌中的STAS:一项多中心回顾性研究
BMC Cancer. 2025 Mar 7;25(1):417. doi: 10.1186/s12885-025-13783-z.
5
Explainable SHAP-XGBoost models for pressure injuries among patients requiring with mechanical ventilation in intensive care unit.用于重症监护病房中需要机械通气的患者压力性损伤的可解释SHAP-XGBoost模型。
Sci Rep. 2025 Mar 22;15(1):9878. doi: 10.1038/s41598-025-92848-2.
6
Development and validation of an interpretable machine learning model for predicting in-hospital mortality for ischemic stroke patients in ICU.用于预测ICU中缺血性中风患者院内死亡率的可解释机器学习模型的开发与验证
Int J Med Inform. 2025 Jun;198:105874. doi: 10.1016/j.ijmedinf.2025.105874. Epub 2025 Mar 9.
7
Development and Validation of an Interpretable Machine Learning Model for Early Prognosis Prediction in ICU Patients with Malignant Tumors and Hyperkalemia.开发和验证一种可解释的机器学习模型,用于预测 ICU 恶性肿瘤合并高钾血症患者的早期预后。
Medicine (Baltimore). 2024 Jul 26;103(30):e38747. doi: 10.1097/MD.0000000000038747.
8
Machine Learning for the Prediction of Acute Kidney Injury in Critically Ill Patients With Coronary Heart Disease: Algorithm Development and Validation.用于预测冠心病重症患者急性肾损伤的机器学习:算法开发与验证
JMIR Med Inform. 2025 May 28;13:e72349. doi: 10.2196/72349.
9
Early prediction of sepsis associated encephalopathy in elderly ICU patients using machine learning models: a retrospective study based on the MIMIC-IV database.使用机器学习模型对老年重症监护病房患者脓毒症相关脑病进行早期预测:一项基于MIMIC-IV数据库的回顾性研究
Front Cell Infect Microbiol. 2025 Apr 17;15:1545979. doi: 10.3389/fcimb.2025.1545979. eCollection 2025.
10
Machine Learning for Predicting Postoperative Functional Disability and Mortality Among Older Patients With Cancer: Retrospective Cohort Study.机器学习用于预测老年癌症患者术后功能残疾和死亡率:回顾性队列研究
JMIR Aging. 2025 May 14;8:e65898. doi: 10.2196/65898.

引用本文的文献

1
Exploration and analysis of risk factors for coronary artery disease with type 2 diabetes based on SHAP explainable machine learning algorithm.基于SHAP可解释机器学习算法的2型糖尿病合并冠状动脉疾病危险因素探索与分析
Sci Rep. 2025 Aug 12;15(1):29521. doi: 10.1038/s41598-025-11142-3.
2
Development and Validation of the Early Gastric Carcinoma Prediction Model in Post-Eradication Patients with Intestinal Metaplasia.根除治疗后肠化生患者早期胃癌预测模型的建立与验证
Cancers (Basel). 2025 Jun 26;17(13):2158. doi: 10.3390/cancers17132158.

本文引用的文献

1
The association between lactate dehydrogenase to serum albumin ratio and in-hospital mortality in patients with pulmonary embolism: a retrospective analysis of the MIMIC-IV database.乳酸脱氢酶与血清白蛋白比值与肺栓塞患者院内死亡率的关联:对MIMIC-IV数据库的回顾性分析
Front Cardiovasc Med. 2024 Jun 19;11:1398614. doi: 10.3389/fcvm.2024.1398614. eCollection 2024.
2
Impact of breast-, gastrointestinal-, and lung cancer on prognosis in patients with first-time pulmonary embolism: A Danish nationwide cohort study.乳腺癌、胃肠道癌和肺癌对首次肺栓塞患者预后的影响:一项丹麦全国队列研究。
Int J Cardiol. 2024 Jul 1;406:132001. doi: 10.1016/j.ijcard.2024.132001. Epub 2024 Mar 30.
3
Explainable artificial intelligence and machine learning: novel approaches to face infectious diseases challenges.
可解释人工智能和机器学习:应对面部传染病挑战的新方法。
Ann Med. 2023;55(2):2286336. doi: 10.1080/07853890.2023.2286336. Epub 2023 Nov 27.
4
Application of machine learning in predicting survival outcomes involving real-world data: a scoping review.机器学习在预测真实世界数据生存结局中的应用:范围综述。
BMC Med Res Methodol. 2023 Nov 13;23(1):268. doi: 10.1186/s12874-023-02078-1.
5
Impact of venous thromboembolism on the mortality in patients with cancer: a population-based cohort study.静脉血栓栓塞对癌症患者死亡率的影响:一项基于人群的队列研究。
Lancet Reg Health Eur. 2023 Sep 28;34:100739. doi: 10.1016/j.lanepe.2023.100739. eCollection 2023 Nov.
6
The enlightening role of explainable artificial intelligence in medical & healthcare domains: A systematic literature review.可解释人工智能在医疗保健领域中的启示作用:系统文献综述。
Comput Biol Med. 2023 Nov;166:107555. doi: 10.1016/j.compbiomed.2023.107555. Epub 2023 Oct 4.
7
Validation of clinical-radiological scores for prognosis of mortality in acute pulmonary embolism.验证临床-影像学评分对急性肺栓塞患者死亡率预后的评估价值。
Respir Res. 2023 Aug 5;24(1):195. doi: 10.1186/s12931-023-02489-0.
8
Glasgow Coma Scale as an Indicator of Patient Prognosis: A Retrospective Study of 257 Patients with Heatstroke from 3 Medical Centers in Guangdong, China.格拉斯哥昏迷评分作为预测患者预后的指标:来自中国广东 3 家医疗中心的 257 例中暑患者的回顾性研究。
Med Sci Monit. 2023 Jun 26;29:e939118. doi: 10.12659/MSM.939118.
9
Compressive Strength Prediction of Cemented Backfill Containing Phosphate Tailings Using Extreme Gradient Boosting Optimized by Whale Optimization Algorithm.基于鲸鱼优化算法优化的极端梯度提升法预测含磷尾矿胶结充填料的抗压强度
Materials (Basel). 2022 Dec 28;16(1):308. doi: 10.3390/ma16010308.
10
Opening the black box: interpretable machine learning for predictor finding of metabolic syndrome.打开黑箱:代谢综合征预测因子发现的可解释机器学习。
BMC Endocr Disord. 2022 Aug 26;22(1):214. doi: 10.1186/s12902-022-01121-4.