Department of Family Medicine, School of Medicine, College of Medicine, Taipei Medical University.
Department of Family Medicine, Taipei Medical University Hospital.
Eur J Gastroenterol Hepatol. 2021 Aug 1;33(8):1117-1123. doi: 10.1097/MEG.0000000000002169.
End-stage liver disease is a global public health problem with a high mortality rate. Early identification of people at risk of poor prognosis is fundamental for decision-making in clinical settings. This study created a machine learning prediction system that provides several related models with visualized graphs, including decision trees, ensemble learning and clustering, to predict mortality in patients with end-stage liver disease.
A retrospective cohort study was conducted: the training data were from patients enrolled from January 2009 to December 2010 and followed up to December 2014; validation data were from patients enrolled from January 2015 to December 2016 and followed up to January 2019. Hospitalized patients with noncancer-related chronic liver disease were identified from the hospital's electrical medical records.
In traditional multivariable logistic regression and Cox proportional hazard model, prothrombin time of international normalized ratio, which was significant with P value = 0.002, odds ratio = 2.790 and hazard ratio 1.363. Besides, blood urea nitrogen and C-reactive protein were also significant, with P value <0.001 and 0.026. The area under the curve was 0.771 in the receiver operating characteristic curve. In machine learning, blood urea nitrogen and age were regarded as the primary factors for predicting mortality. Creatinine, prothrombin time of international normalized ratio and bilirubin were also significant mortality predictors. The area under the curve of the random forest and AdaBoost was 0.838 and 0.792.
The machine learning techniques provided a comprehensive assessment of patient conditions; it could help physicians make an accurate diagnosis of chronic liver disease and improve healthcare management.
终末期肝病是一个具有高死亡率的全球性公共卫生问题。早期识别预后不良风险的人群对于临床决策至关重要。本研究创建了一个机器学习预测系统,提供了几个相关模型的可视化图形,包括决策树、集成学习和聚类,以预测终末期肝病患者的死亡率。
这是一项回顾性队列研究:训练数据来自 2009 年 1 月至 2010 年 12 月期间入组并随访至 2014 年 12 月的患者;验证数据来自 2015 年 1 月至 2016 年 12 月期间入组并随访至 2019 年 1 月的患者。从医院电子病历中确定非癌症相关慢性肝病住院患者。
在传统的多变量逻辑回归和 Cox 比例风险模型中,国际标准化比值的凝血酶原时间具有统计学意义(P 值=0.002,比值比=2.790,危险比=1.363)。此外,血尿素氮和 C 反应蛋白也具有统计学意义(P 值均<0.001 和 0.026)。受试者工作特征曲线下面积为 0.771。在机器学习中,血尿素氮和年龄被视为预测死亡率的主要因素。肌酐、国际标准化比值的凝血酶原时间和胆红素也是重要的死亡率预测因素。随机森林和 AdaBoost 的曲线下面积分别为 0.838 和 0.792。
机器学习技术提供了对患者病情的全面评估;它可以帮助医生对慢性肝病做出准确的诊断,并改善医疗保健管理。