Identifying liver cirrhosis in patients with chronic hepatitis B: an interpretable machine learning algorithm based on LSM.
作者信息
Bai Xueting, Pu Chunwen, Zhen Wenchong, Huang Yushuang, Zhang Qian, Li Zihan, Zhang Yixin, Xu Rongxuan, Yao Zhihan, Wu Wei, Sun Mei, Li Xiaofeng
机构信息
Department of Epidemiology and Health Statistics, Dalian Medical University, Dalian, China.
Dalian Public Health Clinical Center, Dalian, Liaoning province, China.
出版信息
Ann Med. 2025 Dec;57(1):2477294. doi: 10.1080/07853890.2025.2477294. Epub 2025 Mar 19.
BACKGROUND
Chronic hepatitis B (CHB) is a common cause of liver cirrhosis (LC), a condition associated with an unfavourable prognosis. Therefore, timely diagnosis of LC in CHB patients is crucial.
OBJECTIVE
This study aimed to enhance the diagnostic accuracy of LC in CHB patients by integrating liver stiffness measurement (LSM) with traditional indicators.
METHODS
The study participants were randomly divided into training and internal validation sets. Employing the least absolute shrinkage and selection operator (LASSO) and random forest-recursive feature elimination (RF-RFE) for feature selection, we developed both traditional logistic regression and five machine learning models (k-nearest neighbors, random forest (RF), artificial neural network, support vector machine and eXtreme Gradient Boosting). Performance evaluation included receiver operating characteristic curves, calibration curves and decision curve analysis. Shapley additive explanations (SHAP) was employed to improve the interpretability of the optimal model.
RESULTS
We retrospectively included 1609 patients with CHB, among whom 470 were diagnosed with cirrhosis. Cirrhosis was diagnosed based on histological confirmation or clinical assessment, supported by characteristic findings on abdominal ultrasound and corroborative evidence such as thrombocytopenia, varices or imaging from CT/MRI. In the internal validation, the RF model achieved an accuracy above 0.80 and an AUC above 0.80, with outstanding calibration ability and clinical net benefit. Additionally, the model exhibited excellent predictive performance in an independent external validation set. The SHAP analysis indicated that LSM contributed the most to the model. The model still showed strong discriminative power when using only LSM or traditional indicators alone.
CONCLUSIONS
Machine learning models, especially the RF model, can effectively identify LC in CHB patients. Integrating LSM with traditional indicators can enhance diagnostic performance.