Xiong Fei-Xiang, Sun Lei, Zhang Xue-Jie, Chen Jia-Liang, Zhou Yang, Ji Xiao-Min, Meng Pei-Pei, Wu Tong, Wang Xian-Bo, Hou Yi-Xin
Center of Integrative Medicine, Beijing Ditan Hospital, Capital Medical University, Beijing 100015, China.
Department of Pathology, Beijing Ditan Hospital, Beijing 100015, China.
World J Gastroenterol. 2025 Mar 7;31(9):101383. doi: 10.3748/wjg.v31.i9.101383.
The global prevalence of non-alcoholic steatohepatitis (NASH) and its associated risk of adverse outcomes, particularly in patients with advanced liver fibrosis, underscores the importance of early and accurate diagnosis.
To develop a machine learning-based diagnostic model for advanced liver fibrosis in NASH patients.
A total of 749 patients who underwent liver biopsy at Beijing Ditan Hospital, Capital Medical University, between January 2010 and January 2020 were included. Patients were randomly divided into training ( = 522) and validation ( = 224) cohorts. Five machine learning models were applied to predict advanced liver fibrosis, with feature selection based on Shapley Additive Explanations (SHAP). The diagnostic performance of these models was compared to traditional scores such as the aspartate aminotransferase to platelet ratio index (APRI) and fibrosis index based on the 4 factors (FIB-4), using metrics including the area under the receiver operating characteristic curve (AUROC), decision curve analysis (DCA), and calibration curves.
The Extreme Gradient Boosting (XGBoost) model outperformed all other machine learning models, achieving an AUROC of 0.934 (95%CI: 0.914-0.955) in the training cohort and 0.917 (95%CI: 0.880-0.953) in the validation cohort ( < 0.001). Incorporating liver stiffness measurement into the model further improved its performance, with an AUROC of 0.977 (95%CI: 0.966-0.980) in the training cohort and 0.970 (95%CI: 0.950-0.990) in the validation cohort, significantly surpassing APRI and FIB-4 scores ( < 0.001). The XGBoost model also demonstrated superior clinical utility, as evidenced by DCA and calibration curve analysis in both cohorts.
The XGBoost model provides a highly accurate, non-invasive diagnosis of advanced liver fibrosis in NASH patients, outperforming traditional methods. An online tool based on this model has been developed to assist clinicians in evaluating the risk of advanced liver fibrosis.
非酒精性脂肪性肝炎(NASH)的全球患病率及其不良后果的相关风险,尤其是在晚期肝纤维化患者中,凸显了早期准确诊断的重要性。
开发一种基于机器学习的NASH患者晚期肝纤维化诊断模型。
纳入2010年1月至2020年1月期间在首都医科大学附属北京地坛医院接受肝活检的749例患者。患者被随机分为训练组(n = 522)和验证组(n = 224)。应用五种机器学习模型预测晚期肝纤维化,并基于Shapley加性解释(SHAP)进行特征选择。使用受试者操作特征曲线下面积(AUROC)、决策曲线分析(DCA)和校准曲线等指标,将这些模型的诊断性能与传统评分如天冬氨酸转氨酶与血小板比值指数(APRI)和基于4项因子的纤维化指数(FIB-4)进行比较。
极端梯度提升(XGBoost)模型优于所有其他机器学习模型,在训练组中AUROC为0.934(95%CI:0.914 - 0.955),在验证组中为0.917(95%CI:0.880 - 0.953)(P < 0.001)。将肝脏硬度测量纳入模型进一步提高了其性能,训练组的AUROC为0.977(95%CI:0.966 - 0.980),验证组为0.970(95%CI:0.950 - 0.990),显著超过APRI和FIB-4评分(P < 0.001)。XGBoost模型在两个队列的DCA和校准曲线分析中也显示出卓越的临床实用性。
XGBoost模型为NASH患者晚期肝纤维化提供了高度准确的非侵入性诊断,优于传统方法。基于该模型的在线工具已开发出来,以协助临床医生评估晚期肝纤维化风险。