Jiang Chuan, Xu Zhenyu, Liu Jinqing, Li Ronghua, Chen Keyu, Peng Wenting, Xiao Yueming, Cheng Da, Fu Lei, Peng Shifang
Department of Infectious Diseases, Xiangya Hospital, Central South University, No. 87 Xiangya Road, Changsha, 410008, Hunan, China.
Department of Infectious Diseases, The Second Xiangya Hospital, Central South University, No. 139 Renmin Middle Road, Changsha, 410011, Hunan, China.
Sci Rep. 2025 Jan 2;15(1):571. doi: 10.1038/s41598-024-85012-9.
This study aims to construct and validate noninvasive diagnosis models for evaluating significant liver fibrosis in patients with chronic hepatitis B (CHB). A cohort of 259 CHB patients were selected as research subjects. Through random grouping, 182 cases were included in the training set and 77 cases in the validation set. The nomogram was developed based on univariate analysis and multivariate regression analysis. Various machine learning models were employed to construct prediction models for significant liver fibrosis. The area under the ROC curve (AUC), sensitivity, specificity, NPV, PPV, and F1 score were used to evaluate the diagnostic performance. The new nomogram had excellent diagnostic efficiency (AUC 0.806, 95% CI: 0.740-0.872). Compared with other traditional noninvasive diagnostic models, the nomogram demonstrated higher AUC values and better prediction performance. Among six machine learning models, the random forest (RF) model achieved the highest AUC (AUC 0.819, 95% CI: 0.720-0.906). Finally, the importance of all variables in the RF model was ordered to illustrate the contribution of different variables, providing the clinical factors associated with the risk of significant liver fibrosis. This new nomogram may more reliably than other traditional models and the RF model demonstrated superior accuracy among six machine learning models.
本研究旨在构建并验证用于评估慢性乙型肝炎(CHB)患者显著肝纤维化的无创诊断模型。选取259例CHB患者作为研究对象。通过随机分组,182例纳入训练集,77例纳入验证集。基于单因素分析和多因素回归分析构建列线图。采用多种机器学习模型构建显著肝纤维化的预测模型。采用ROC曲线下面积(AUC)、灵敏度、特异度、阴性预测值、阳性预测值和F1评分评估诊断性能。新列线图具有优异的诊断效率(AUC 0.806,95%CI:0.740 - 0.872)。与其他传统无创诊断模型相比,列线图显示出更高的AUC值和更好的预测性能。在六种机器学习模型中,随机森林(RF)模型的AUC最高(AUC 0.819,95%CI:0.720 - 0.906)。最后,对RF模型中所有变量的重要性进行排序,以说明不同变量的贡献,提供与显著肝纤维化风险相关的临床因素。这种新列线图可能比其他传统模型更可靠,且RF模型在六种机器学习模型中显示出卓越的准确性。