开发、验证和评估一种简单的机器学习模型以预测肝硬化死亡率。

Development, Validation, and Evaluation of a Simple Machine Learning Model to Predict Cirrhosis Mortality.

机构信息

Section of Gastroenterology and Hepatology, Department of Medicine, Baylor College of Medicine, Houston, Texas.

Health Services Research, Department of Medicine, Baylor College of Medicine, Houston, Texas.

出版信息

JAMA Netw Open. 2020 Nov 2;3(11):e2023780. doi: 10.1001/jamanetworkopen.2020.23780.

DOI:10.1001/jamanetworkopen.2020.23780

PMID:33141161

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7610191/

Abstract

IMPORTANCE

Machine-learning algorithms offer better predictive accuracy than traditional prognostic models but are too complex and opaque for clinical use.

OBJECTIVE

To compare different machine learning methods in predicting overall mortality in cirrhosis and to use machine learning to select easily scored clinical variables for a novel cirrhosis prognostic model.

DESIGN, SETTING, AND PARTICIPANTS: This prognostic study used a retrospective cohort of adult patients with cirrhosis or its complications seen in 130 hospitals and affiliated ambulatory clinics in the integrated, national Veterans Affairs health care system from October 1, 2011, to September 30, 2015. Patients were followed up through December 31, 2018. Data were analyzed from October 1, 2017, to May 31, 2020.

EXPOSURES

Potential predictors included demographic characteristics; liver disease etiology, severity, and complications; use of health care resources; comorbid conditions; and comprehensive laboratory and medication data. Patients were randomly selected for model development (66.7%) and validation (33.3%). Three different statistical and machine learning methods were evaluated: gradient descent boosting, logistic regression with least absolute shrinkage and selection operator (LASSO) regularization, and logistic regression with LASSO constrained to select no more than 10 predictors (partial pathway model). Predictor inclusion and model performance were evaluated in a 5-fold cross-validation. Last, the predictors identified in the most parsimonious (the partial path) model were refit using maximum-likelihood estimation (Cirrhosis Mortality Model [CiMM]), and its predictive performance was compared with that of the widely used Model for End Stage Liver Disease with sodium (MELD-Na) score.

MAIN OUTCOMES AND MEASURES

All-cause mortality.

RESULTS

Of the 107 939 patients with cirrhosis (mean [SD] age, 62.7 [9.6] years; 96.6% male; 66.3% white, 18.4% African American), the annual mortality rate ranged from 8.8% to 15.3%. In total, 32.7% of patients died within 3 years, and 46.2% died within 5 years after the index date. Models predicting 1-year mortality had good discrimination for the gradient descent boosting (area under the receiver operating characteristics curve [AUC], 0.81; 95% CI, 0.80-0.82), logistic regression with LASSO regularization (AUC, 0.78; 95% CI, 0.77-0.79), and the partial path logistic model (AUC, 0.78; 95% CI, 0.76-0.78). All models showed good calibration. The final CiMM model with machine learning-derived clinical variables offered significantly better discrimination than the MELD-Na score, with AUCs of 0.78 (95% CI, 0.77-0.79) vs 0.67 (95% CI, 0.66-0.68) for 1-year mortality, respectively (DeLong z = 17.00; P < .001).

CONCLUSIONS AND RELEVANCE

In this study, simple machine learning techniques performed as well as the more advanced ensemble gradient boosting. Using the clinical variables identified from simple machine learning in a cirrhosis mortality model produced a new score more transparent than machine learning and more predictive than the MELD-Na score.

摘要

重要性

机器学习算法提供了比传统预后模型更好的预测准确性，但对于临床应用来说过于复杂和不透明。

目的

比较不同的机器学习方法在预测肝硬化总体死亡率中的应用，并使用机器学习为新的肝硬化预后模型选择易于评分的临床变量。

设计、地点和参与者：这项预后研究使用了 2011 年 10 月 1 日至 2015 年 9 月 30 日期间在退伍军人事务部综合国家医疗保健系统的 130 家医院和附属门诊诊所中看到的成年肝硬化或其并发症患者的回顾性队列。患者随访至 2018 年 12 月 31 日。数据于 2017 年 10 月 1 日至 2020 年 5 月 31 日进行分析。

暴露

潜在预测因子包括人口统计学特征；肝脏疾病病因、严重程度和并发症；卫生保健资源的使用；合并症；以及综合实验室和药物数据。患者随机选择用于模型开发（66.7%）和验证（33.3%）。评估了三种不同的统计和机器学习方法：梯度下降增强、最小绝对收缩和选择算子（LASSO）正则化的逻辑回归，以及限制选择不超过 10 个预测因子的 LASSO 约束的逻辑回归（部分路径模型）。在 5 折交叉验证中评估了预测因子的纳入和模型性能。最后，使用最大似然估计（肝硬化死亡率模型 [CiMM]）重新拟合在最简约（部分路径）模型中确定的预测因子，并将其预测性能与广泛使用的模型进行比较。用于终末期肝病的钠（MELD-Na）评分。

主要结果和措施

全因死亡率。

结果

在 107939 例肝硬化患者中（平均[标准差]年龄，62.7[9.6]岁；96.6%为男性；66.3%为白人，18.4%为非裔美国人），年死亡率范围为 8.8%至 15.3%。总共有 32.7%的患者在 3 年内死亡，46.2%的患者在指数日期后 5 年内死亡。预测 1 年死亡率的模型对于梯度下降增强（接受者操作特征曲线下面积[AUROC]，0.81；95%CI，0.80-0.82）、具有 LASSO 正则化的逻辑回归（AUROC，0.78；95%CI，0.77-0.79）和部分路径逻辑模型具有良好的判别能力（AUROC，0.78；95%CI，0.76-0.78）。所有模型均显示出良好的校准。使用机器学习衍生的临床变量的最终 CiMM 模型与 MELD-Na 评分相比，提供了显著更好的鉴别能力，1 年死亡率的 AUC 分别为 0.78（95%CI，0.77-0.79）和 0.67（95%CI，0.66-0.68）（DeLong z=17.00；P<0.001）。

结论和相关性

在这项研究中，简单的机器学习技术与更先进的集成梯度增强一样有效。使用简单机器学习从肝硬化死亡率模型中确定的临床变量生成了一个新的评分，该评分比机器学习更透明，比 MELD-Na 评分更具预测性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0bf/7610191/f9e73e402ac0/jamanetwopen-e2023780-g001.jpg

相似文献

Development, Validation, and Evaluation of a Simple Machine Learning Model to Predict Cirrhosis Mortality.

JAMA Netw Open. 2020 Nov 2;3(11):e2023780. doi: 10.1001/jamanetworkopen.2020.23780.

Prediction of acute kidney injury in patients with liver cirrhosis using machine learning models: evidence from the MIMIC-III and MIMIC-IV.

Int Urol Nephrol. 2024 Jan;56(1):237-247. doi: 10.1007/s11255-023-03646-6. Epub 2023 May 31.

Predicting mortality among patients with liver cirrhosis in electronic health records with machine learning.

PLoS One. 2021 Aug 31;16(8):e0256428. doi: 10.1371/journal.pone.0256428. eCollection 2021.

Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?

Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

A revised scope in different prognostic models in cirrhotic patients: Current and future perspectives, an Egyptian experience.

Arab J Gastroenterol. 2013 Dec;14(4):158-64. doi: 10.1016/j.ajg.2013.08.007. Epub 2013 Sep 29.

Machine Learning Approaches to Predict 6-Month Mortality Among Patients With Cancer.

JAMA Netw Open. 2019 Oct 2;2(10):e1915997. doi: 10.1001/jamanetworkopen.2019.15997.

Low Predictability of Readmissions and Death Using Machine Learning in Cirrhosis.

Am J Gastroenterol. 2021 Feb 1;116(2):336-346. doi: 10.14309/ajg.0000000000000971.

Can Machine-learning Algorithms Predict Early Revision TKA in the Danish Knee Arthroplasty Registry?

Clin Orthop Relat Res. 2020 Sep;478(9):2088-2101. doi: 10.1097/CORR.0000000000001343.

Comparative Effectiveness of Machine Learning Approaches for Predicting Gastrointestinal Bleeds in Patients Receiving Antithrombotic Treatment.

JAMA Netw Open. 2021 May 3;4(5):e2110703. doi: 10.1001/jamanetworkopen.2021.10703.

The Refit model for end-stage liver disease-Na is not a better predictor of mortality than the Refit model for end-stage liver disease in patients with cirrhosis and ascites.

Clin Mol Hepatol. 2014 Mar;20(1):47-55. doi: 10.3350/cmh.2014.20.1.47. Epub 2014 Mar 26.

引用本文的文献

Sina Score as a New Machine Learning-Derived Online Prediction Model of Mortality for Cirrhotic Patients Awaiting Liver Transplantation: A Prospective Cohort Study.

J Clin Med. 2025 Jun 27;14(13):4559. doi: 10.3390/jcm14134559.

Coagulation and Transfusion Informatics in Chronic Liver Disease: A Data Linkage Study of Emergency Department Presentations.

EJHaem. 2025 Jul 10;6(4):e70101. doi: 10.1002/jha2.70101. eCollection 2025 Aug.

Sarcopenia and frailty: An in-depth analysis of the pathophysiology and effect on liver transplant candidates.

World J Hepatol. 2025 May 27;17(5):106182. doi: 10.4254/wjh.v17.i5.106182.

Machine learning in solid organ transplantation: Charting the evolving landscape.

World J Transplant. 2025 Mar 18;15(1):99642. doi: 10.5500/wjt.v15.i1.99642.

Establishment and validation of a nomogram for predicting esophagogastric variceal bleeding in patients with liver cirrhosis.

World J Gastroenterol. 2025 Mar 7;31(9):102714. doi: 10.3748/wjg.v31.i9.102714.

The Role of Machine Learning Models in Predicting Cirrhosis Mortality: A Systematic Review.

Cureus. 2025 Jan 28;17(1):e78155. doi: 10.7759/cureus.78155. eCollection 2025 Jan.

Artificial intelligence-based evaluation of prognosis in cirrhosis.

J Transl Med. 2024 Oct 14;22(1):933. doi: 10.1186/s12967-024-05726-2.

Development and validation of a predictive model for prolonged length of stay in elderly type 2 diabetes mellitus patients combined with cerebral infarction.

Front Neurol. 2024 Aug 1;15:1405096. doi: 10.3389/fneur.2024.1405096. eCollection 2024.

Artificial Intelligence and the Future of Gastroenterology and Hepatology.

Gastro Hep Adv. 2022 May 11;1(4):581-595. doi: 10.1016/j.gastha.2022.02.025. eCollection 2022.

Development and validation of prediction models for nosocomial infection and prognosis in hospitalized patients with cirrhosis.

Antimicrob Resist Infect Control. 2024 Aug 7;13(1):85. doi: 10.1186/s13756-024-01444-y.

本文引用的文献

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.

Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.

Model for End-Stage Liver Disease-Lactate and Prediction of Inpatient Mortality in Patients With Chronic Liver Disease.

Hepatology. 2020 Nov;72(5):1747-1757. doi: 10.1002/hep.31199.

Development of a national Department of Veterans Affairs mortality risk prediction model among patients with cirrhosis.

BMJ Open Gastroenterol. 2019 Nov 26;6(1):e000342. doi: 10.1136/bmjgast-2019-000342. eCollection 2019.

Risk prediction scores for acute on chronic liver failure development and mortality.

Liver Int. 2020 May;40(5):1159-1167. doi: 10.1111/liv.14328. Epub 2019 Dec 26.

Integrated Model for Patient-Centered Advanced Liver Disease Care.

Clin Gastroenterol Hepatol. 2020 May;18(5):1015-1024. doi: 10.1016/j.cgh.2019.07.043. Epub 2019 Jul 26.

Oblique Survival Trees in Discrete Event Time Analysis.

IEEE J Biomed Health Inform. 2020 Jan;24(1):247-258. doi: 10.1109/JBHI.2019.2908773. Epub 2019 Apr 1.

Development of Quality Measures in Cirrhosis by the Practice Metrics Committee of the American Association for the Study of Liver Diseases.

Hepatology. 2019 Apr;69(4):1787-1797. doi: 10.1002/hep.30489. Epub 2019 Mar 12.

Hazards of Hazard Ratios - Deviations from Model Assumptions in Immunotherapy.

N Engl J Med. 2018 Mar 22;378(12):1158-1159. doi: 10.1056/NEJMc1716612.

Beyond discrimination: A comparison of calibration methods and clinical usefulness of predictive models of readmission risk.

J Biomed Inform. 2017 Dec;76:9-18. doi: 10.1016/j.jbi.2017.10.008. Epub 2017 Oct 24.

Machine Learning and Prediction in Medicine - Beyond the Peak of Inflated Expectations.

N Engl J Med. 2017 Jun 29;376(26):2507-2509. doi: 10.1056/NEJMp1702071.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

开发、验证和评估一种简单的机器学习模型以预测肝硬化死亡率。

Development, Validation, and Evaluation of a Simple Machine Learning Model to Predict Cirrhosis Mortality.

机构信息

Section of Gastroenterology and Hepatology, Department of Medicine, Baylor College of Medicine, Houston, Texas.

Health Services Research, Department of Medicine, Baylor College of Medicine, Houston, Texas.